Feature/89 rload fn #90

mitzimorris · 2019-07-29T15:55:29Z

Submission Checklist

Run unit tests
Declare copyright holder and open-source license: see below

Summary

Add utility function rload - see issue #89 for discussion.
Test data for unit tests taken from Stan src/test/test-models/good.

Copyright and Licensing

Please list the copyright holder for the work you are submitting (this will be you or your assignee, such as a university or company): Columbia University

By submitting this pull request, the copyright holder is agreeing to license the submitted work under the following licenses:

Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)

ahartikainen

Looks good. Added a few comments

ahartikainen · 2019-07-30T09:37:11Z

cmdstanpy/utils.py

+            *_, vals, dim = rhs.replace('(', ' ').replace(')', ' ').split('c')
+            vals = [float(v) for v in vals.split(',')[:-1]]
+            dim = [int(v) for v in dim.split(',')]
+            val = np.array(vals).reshape(dim[::-1]).T


what is going on here?

Would this equal

np.array(vals).reshape(dim[::-1]).T --> np.array(vals, order='F').reshape(dim, order='F') or --> np.array(vals).reshape(dim, order='F') or --> np.reshape(vals, dim, order='F')

agreed, that's not very clear. I added a subroutine that processes the Rdump multi-dim structure - used an ugly regex with named groups for the essential bits - array values, array dimensions. I ran the Rdump file in R to see what R does, then added unit tests that verify that the array values end up in the right place. hope this helps.

ahartikainen · 2019-07-30T09:46:04Z

cmdstanpy/utils.py

+            idx += 1
+        next_var = idx
+        var_data = ''.join(lines[start_idx:next_var]).replace('\n', '')
+        lhs, rhs = [_.strip() for _ in var_data.split('<-')]


This is fine, just a small comment: using _ instead of item seems weird to me.

agreed, fixed.

ahartikainen · 2019-07-30T09:48:25Z

cmdstanpy/utils.py

+    """
+    data_dict = {}
+    with open(fname, 'r') as fp:
+        lines = fp.readlines()


This is fine, larger than RAM files are not really that common.

cmdstanpy/utils.py

mitzimorris · 2019-07-31T15:01:38Z

I did a rewrite of the drump parsing logic and plugged it into the read_rdump_metric helper fn. @ahartikainen, if you have any free cycles, please recheck.

(this feature is as much of a PITA as everything else to do with R, but I can see its utility...)

mitzimorris added 10 commits July 18, 2019 21:08

Merge branch 'master' of https://github.com/stan-dev/cmdstanpy

35a604e

updating version

d3b6bf2

opt_lvl 3

54f04a3

Merge branch 'master' of https://github.com/stan-dev/cmdstanpy

35db102

Merge branch 'master' of https://github.com/stan-dev/cmdstanpy

d0e55a0

add rload

34535b6

update version

3175df7

adding unit test, test data

0a2a926

adding unit test, test data

ee95f9a

lint fix

fadddfe

mitzimorris requested review from maedoc and ahartikainen July 29, 2019 15:55

updating docstring, more tests

5af7ed3

ahartikainen reviewed Jul 30, 2019

View reviewed changes

mitzimorris added 6 commits July 30, 2019 17:30

cleanup rload parse logic

5a41b8b

cleanup rload parse logic

6551a69

cleanup rload parse logic

ac34566

use rload in read_rdump_metric; tests for scalar float, sci notation

1d07295

test data files cleanup

08693c8

code cleanup

70ec077

mitzimorris merged commit a20b051 into master Aug 9, 2019

ahartikainen deleted the feature/89-rload-fn branch August 11, 2019 18:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Feature/89 rload fn #90

Feature/89 rload fn #90

Uh oh!

mitzimorris commented Jul 29, 2019

Uh oh!

ahartikainen left a comment

Uh oh!

ahartikainen Jul 30, 2019

Uh oh!

mitzimorris Jul 30, 2019

Uh oh!

ahartikainen Jul 30, 2019

Uh oh!

mitzimorris Jul 30, 2019

Uh oh!

ahartikainen Jul 30, 2019

Uh oh!

Uh oh!

mitzimorris commented Jul 31, 2019

Uh oh!

Uh oh!

Uh oh!

Feature/89 rload fn #90

Feature/89 rload fn #90

Uh oh!

Conversation

mitzimorris commented Jul 29, 2019

Submission Checklist

Summary

Copyright and Licensing

Uh oh!

ahartikainen left a comment

Choose a reason for hiding this comment

Uh oh!

ahartikainen Jul 30, 2019

Choose a reason for hiding this comment

Uh oh!

mitzimorris Jul 30, 2019

Choose a reason for hiding this comment

Uh oh!

ahartikainen Jul 30, 2019

Choose a reason for hiding this comment

Uh oh!

mitzimorris Jul 30, 2019

Choose a reason for hiding this comment

Uh oh!

ahartikainen Jul 30, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mitzimorris commented Jul 31, 2019

Uh oh!

Uh oh!