new trait supports-collect-measured #21

untzag · 2023-09-28T21:06:23Z

I'm sure I've made a bunch of small errors, but I think this gets the idea down.

I'd like some comments from the community, and then I'm going to go ahead and implement this in yaqd-core-python.

I believe the Python implementation should be as simple as a mix-in class. Should be pretty easy to add to any existing sensor daemon.

for more information, see https://pre-commit.ci

kameyer226 · 2023-09-28T21:30:59Z

Not sure how this can work. One would need to clear out the spooler/buffer or overwrite old entries. But how would one know when to do that, esp. if multiple clients are open and are polling differently?

untzag · 2023-09-28T23:26:09Z

Great question @kameyer226. Clearing old measurements from memory is entirely at the discretion of the daemon. Each daemon will have some number of measurements cached and when the client sends collect_measured the daemon will do its best to provide the information that client requested. The client has no guarantee of how many measurements the daemon will cache. It's a "best effort" interface.

In a multiclient scenario there's no difference, really. The daemon shouldn't clear the cache just because a client read from it. In this way this is different from the Bluesky Flyer interface.

In the Python implementation I'm imagining the cache implemented using a collections.deque circular buffer with a set size. However I want children to be able to override this behavior.

Of course, this is just the beginning of this suggestion! I'm open to change the design.

untzag · 2023-09-28T23:27:16Z

Note to self, I guess you might need to collect mappings as well. Frustrating because the traits collide a little bit.

I guess collect mapping could be another trait entirely.

ukurumbail · 2023-09-29T17:42:16Z

Sounds great Blaise! A few questions:

Is there a parameter in the daemon to define the rate of data collection from the peripheral device?
How will this work with, for example, auto_rxn? Would the rxn log still provide values at approx. 1 Hz with an additional file for all the intervening values from 'fast' devices?
Cheers!

ddkohler

It seems like this is a good structure for asynchronous applications. All my questions and issues are minor.

ddkohler · 2023-09-29T18:32:09Z

yeps/yep-314.md

+
+This trait adds the method `collect_measured`. This method returns an avro array of arrays, where each inner array contains exactly two items: a unix timestamp float and the associated measurement as an avro map. Bluesky users can think of this as a list of readings.
+
+`collect_measured` accepts one optional argument, `measurement_id`, an integer with default of `null`. If a client provides this argument, the daemon will return only measurements since that id, inclusive. Because each measurement mapping already contains `measurement_id` as specified by the is-sensor trait, there is no ambiguity if clients need to cross-reference to ids. Daemons must account for overflow of `measurement_id`.


What is the standard behavior of collect_measured with no argument? All measurements are returned? The last measurement is returned?

My idea is that all the messages in the cache would be returned. I suppose we could have a second optional argument of max_measurements with default of zero meaning "all".

yeps/yep-314.md

ddkohler · 2023-09-29T18:34:31Z

yeps/yep-314.md

+
+# Motivation
+
+Some experiments incorporate sensors that are very fast or very slow compared to other pieces of hardware being controlled. In such cases it might be natural to let the sensor run asyncronously with other hardware, simply recording each time there is a new sensor reading. Unfortunately, the design of the "get_measured" message defined by is-sensor does not make asyncronus acquistion easy. Clients must ensure that they are polling quickly in order to ensure they don't miss measurements. In extreme cases, sensors are so fast that client polling is simply not practical.


Just to make sure I am understanding the application, this is most useful for when sensors are very fast? Does this trait also offer advantages when sensors are very slow?

I agree it's most useful for very fast things.

For very slow things it's also great, because you just don't have to poll.

It's useful for any measurement that's fundamentally asynchronous with your central datastream.

ddkohler · 2023-09-29T18:35:41Z

yeps/yep-314.md

+
+This trait adds the method `collect_measured`. This method returns an avro array of arrays, where each inner array contains exactly two items: a unix timestamp float and the associated measurement as an avro map. Bluesky users can think of this as a list of readings.
+
+`collect_measured` accepts one optional argument, `measurement_id`, an integer with default of `null`. If a client provides this argument, the daemon will return only measurements since that id, inclusive. Because each measurement mapping already contains `measurement_id` as specified by the is-sensor trait, there is no ambiguity if clients need to cross-reference to ids. Daemons must account for overflow of `measurement_id`.


Consider allowing negative integers as an argument; e.g. collect_measured(-2) will grab the last two measurements?

That would work, as an alternative to my suggestion of a max_measurements argument above...

I think I like it.

yeps/yep-314.md

untzag · 2023-10-19T15:18:00Z

Thanks for your comments @ukurumbail

Is there a parameter in the daemon to define the rate of data collection from the peripheral device?

Unfortunately I have to give the frustrating answer of "maybe". Some daemons will have such a parameter, but many might not. I'd rather leave it up to each daemon, because there are often hardware limitations that make the parameterization of collection rate special.

How will this work with, for example, auto_rxn? Would the rxn log still provide values at approx. 1 Hz with an additional file for all the intervening values from 'fast' devices?

My imagination with auto_rxn is that this would be a separate stream. Just as you imagine it.

Co-authored-by: Daniel Kohler <[email protected]>

Co-authored-by: Kyle Sunden <[email protected]>

ksunden

@untzag feel free to merge with or without the changes discussed regarding negative values and/or semantics of 0

new trait supports-collect-measured

d6954ad

untzag self-assigned this Sep 28, 2023

untzag requested a review from ksunden as a code owner September 28, 2023 21:06

[pre-commit.ci] auto fixes from pre-commit.com hooks

e2c1738

for more information, see https://pre-commit.ci

ddkohler approved these changes Sep 29, 2023

View reviewed changes

ksunden reviewed Oct 19, 2023

View reviewed changes

yeps/yep-314.md Outdated Show resolved Hide resolved

untzag and others added 2 commits October 19, 2023 10:20

Update yeps/yep-314.md

a0c00c4

Co-authored-by: Daniel Kohler <[email protected]>

Apply suggestions from code review

9098e0b

Co-authored-by: Kyle Sunden <[email protected]>

ksunden approved these changes Oct 19, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new trait supports-collect-measured #21

new trait supports-collect-measured #21

untzag commented Sep 28, 2023

kameyer226 commented Sep 28, 2023

untzag commented Sep 28, 2023

untzag commented Sep 28, 2023 •

edited

Loading

ukurumbail commented Sep 29, 2023

ddkohler left a comment

ddkohler Sep 29, 2023

untzag Oct 19, 2023

ddkohler Sep 29, 2023

untzag Oct 19, 2023

ddkohler Sep 29, 2023

untzag Oct 19, 2023

untzag commented Oct 19, 2023

ksunden left a comment


		This trait adds the method `collect_measured`. This method returns an avro array of arrays, where each inner array contains exactly two items: a unix timestamp float and the associated measurement as an avro map. Bluesky users can think of this as a list of readings.

		`collect_measured` accepts one optional argument, `measurement_id`, an integer with default of `null`. If a client provides this argument, the daemon will return only measurements since that id, inclusive. Because each measurement mapping already contains `measurement_id` as specified by the is-sensor trait, there is no ambiguity if clients need to cross-reference to ids. Daemons must account for overflow of `measurement_id`.


		# Motivation

		Some experiments incorporate sensors that are very fast or very slow compared to other pieces of hardware being controlled. In such cases it might be natural to let the sensor run asyncronously with other hardware, simply recording each time there is a new sensor reading. Unfortunately, the design of the "get_measured" message defined by is-sensor does not make asyncronus acquistion easy. Clients must ensure that they are polling quickly in order to ensure they don't miss measurements. In extreme cases, sensors are so fast that client polling is simply not practical.

new trait supports-collect-measured #21

Are you sure you want to change the base?

new trait supports-collect-measured #21

Conversation

untzag commented Sep 28, 2023

kameyer226 commented Sep 28, 2023

untzag commented Sep 28, 2023

untzag commented Sep 28, 2023 • edited Loading

ukurumbail commented Sep 29, 2023

ddkohler left a comment

Choose a reason for hiding this comment

ddkohler Sep 29, 2023

Choose a reason for hiding this comment

untzag Oct 19, 2023

Choose a reason for hiding this comment

ddkohler Sep 29, 2023

Choose a reason for hiding this comment

untzag Oct 19, 2023

Choose a reason for hiding this comment

ddkohler Sep 29, 2023

Choose a reason for hiding this comment

untzag Oct 19, 2023

Choose a reason for hiding this comment

untzag commented Oct 19, 2023

ksunden left a comment

Choose a reason for hiding this comment

untzag commented Sep 28, 2023 •

edited

Loading