Explicitly populate feature dictionaries with all available features. #1019

eleftherioszisis · 2022-04-05T13:42:16Z

This is a PoC of registering all features that will be available by features.get beforehand, instead of implicitly delegating a neurite, morphology or population or collections thereof, to the respective feature when get is called.

The _NEURITE_FEATURES, _MORPHOLOGY_FEATURES, _POPULATION_FEATURES dictionaries are updated following registration so that features from downstream categories (e.g. neurite features) are transformed and registered to upstream categories (e.g. morphology & population feature dictionaries).

If a feature is already registered in upstream categories, because it is defined in the respective module, it is not overwritten and the module definition is used instead.

This PoC allows to define reducible features, i.e. features that can be used by higher order objects by applying them to their components (e.g. a neurite feature can be applied to the neurites of a morphology or population). If a feature is not reducible it should be defined in each module (neurite.py, morphology.py, population.py) so the special logic is used instead.

The advantage of this change is that the available features become explicit and populate the respective feature dictionaries. In this context, features.get only needs to check what type of object is passed and get the respective function from the dictionaries without any special logic.

eleftherioszisis · 2022-04-05T13:58:29Z

See all available features as shown in the features.get help using this change: features.txt

eleftherioszisis · 2022-04-05T14:30:49Z

@adrien-berchet , @lidakanari , @arnaudon , please feel free to take a look if you have the time. I am not adding you as reviewers yet because it's still a PoC that needs to be discussed.

adrien-berchet · 2022-04-05T14:50:33Z

It looks nice!
I had a quick look and I am just wondering what happens in the current version for generators. Because with this PR, as far as I can see generators would not be caught by collections.abc.Sequence so it would fail, and I am wondering if it would be a breaking change.

eleftherioszisis · 2022-04-05T17:55:50Z

It looks nice! I had a quick look and I am just wondering what happens in the current version for generators. Because with this PR, as far as I can see generators would not be caught by collections.abc.Sequence so it would fail, and I am wondering if it would be a breaking change.

Thanks for checking it out! In the master's implementation, generators are not supported either:

NeuroM/neurom/features/__init__.py

Lines 88 to 91 in 623cbd9

    
           is_obj_list = isinstance(obj, (list, tuple)) 
        
           if not isinstance(obj, (Neurite, Morphology, Population)) and not is_obj_list: 
        
               raise NeuroMError('Only Neurite, Morphology, Population or list, tuple of Neurite,' 
        
                                 ' Morphology can be used for feature calculation')

Thus, I have not introduced a new behavior in this implementation.

Supporting generators is an interesting topic. Not sure if practical or if there are use cases that require them, but I believe with a bit of extra logic you could check the first element and return a chain of the first element with the rest of the generator to pass into the feature functions.

adrien-berchet · 2022-04-06T08:10:54Z

Ok!
I wondered about generators because of the Population objects since their __iter__ method returns a generator. So I think we could have the following use case: we iterate over a population using the generator, we change something in the morphology and yield this new morphology on which we want to compute the feature. In this case we would need to support generators. But it's quite specific.

eleftherioszisis · 2022-04-07T08:57:46Z

Ok! I wondered about generators because of the Population objects since their __iter__ method returns a generator. So I think we could have the following use case: we iterate over a population using the generator, we change something in the morphology and yield this new morphology on which we want to compute the feature. In this case we would need to support generators. But it's quite specific.

Certainly, we could address this in a separate issue.

GianlucaFicarelli

With this PR the complexity is moved from _get_feature_value_and_func to _transform_downstream_features_to_upstream_feature_categories but overall it seems cleaner to me.
I'd like to completely remove the if isinstance chain, but it seems not possible if we need to keep the same API accepting a generic object.

GianlucaFicarelli · 2022-04-08T16:29:40Z

neurom/features/__init__.py


 from neurom.core import Population, Morphology, Neurite
 from neurom.core.morphology import iter_neurites
 from neurom.core.types import NeuriteType, tree_type_checker as is_type
+from neurom.utils import flatten


flatten and wraps are unused imports

GianlucaFicarelli · 2022-04-08T16:58:02Z

neurom/features/__init__.py

+
+def _flatten_feature(feature_value, feature_shape):
+    """Flattens feature values. Applies for population features for backward compatibility."""
+    return feature_value if feature_shape == () else reduce(operator.concat, feature_value, [])


This comes from the previous code, but wouldn't flatten/itertools.chain work?

GianlucaFicarelli · 2022-04-08T17:06:43Z

neurom/features/__init__.py

+
+def _get_neurites_feature_value(feature_, obj, kwargs):
+    """Collects neurite feature values appropriately to feature's shape."""
+    kwargs = deepcopy(kwargs)


I think the code can be simplified if the signature is changed to:

def _get_neurites_feature_value(feature_, obj, /, neurite_type=NeuriteType.all, **kwargs):

GianlucaFicarelli · 2022-04-08T17:22:49Z

neurom/features/__init__.py

+# Update the feature dictionaries so that features from lower categories are transformed and usable
+# by upstream categories. For example, a neurite feature will be added to morphology and population
+# feature dictionaries, transformed so that it works with the respective objects.
+_transform_downstream_features_to_upstream_feature_categories(_FEATURE_CATEGORIES)


Maybe you can try to move most of the content of this file to a different file, keeping here only the needed imports?

GianlucaFicarelli · 2022-04-10T17:07:07Z

neurom/features/__init__.py

+        feature_(n, **kwargs) for n in iter_neurites(obj, filt=is_type(neurite_type))
+    )
+
+    return reduce(operator.add, per_neurite_values, 0 if feature_.shape == () else [])


It seems clearer to me something like that (and in general more performant):

Suggested change

return reduce(operator.add, per_neurite_values, 0 if feature_.shape == () else [])

func = sum if feature_.shape == () else flatten_to_list

return func(per_neurite_values)

where flatten_to_list(x) is list(flatten(x))

eleftherioszisis added 2 commits April 5, 2022 00:21

Refactor get

ec04df6

Refactoring

a8de3de

eleftherioszisis requested review from GianlucaFicarelli and mgeplf April 7, 2022 08:56

GianlucaFicarelli reviewed Apr 11, 2022

View reviewed changes

mgeplf added the version 4.0 label Apr 21, 2022

eleftherioszisis removed the version 4.0 label May 10, 2022

adrien-berchet added this to the v4 milestone Feb 22, 2023

adrien-berchet added the version 4.0 label Jul 6, 2023

eleftherioszisis removed the version 4.0 label Apr 22, 2024

eleftherioszisis removed this from the v4 milestone May 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explicitly populate feature dictionaries with all available features. #1019

Explicitly populate feature dictionaries with all available features. #1019

eleftherioszisis commented Apr 5, 2022

eleftherioszisis commented Apr 5, 2022

eleftherioszisis commented Apr 5, 2022

adrien-berchet commented Apr 5, 2022

eleftherioszisis commented Apr 5, 2022

adrien-berchet commented Apr 6, 2022

eleftherioszisis commented Apr 7, 2022

GianlucaFicarelli left a comment

GianlucaFicarelli Apr 8, 2022

GianlucaFicarelli Apr 8, 2022

GianlucaFicarelli Apr 8, 2022

GianlucaFicarelli Apr 8, 2022

GianlucaFicarelli Apr 10, 2022

	return reduce(operator.add, per_neurite_values, 0 if feature_.shape == () else [])
	func = sum if feature_.shape == () else flatten_to_list
	return func(per_neurite_values)

Explicitly populate feature dictionaries with all available features. #1019

Are you sure you want to change the base?

Explicitly populate feature dictionaries with all available features. #1019

Conversation

eleftherioszisis commented Apr 5, 2022

eleftherioszisis commented Apr 5, 2022

eleftherioszisis commented Apr 5, 2022

adrien-berchet commented Apr 5, 2022

eleftherioszisis commented Apr 5, 2022

adrien-berchet commented Apr 6, 2022

eleftherioszisis commented Apr 7, 2022

GianlucaFicarelli left a comment

Choose a reason for hiding this comment

GianlucaFicarelli Apr 8, 2022

Choose a reason for hiding this comment

GianlucaFicarelli Apr 8, 2022

Choose a reason for hiding this comment

GianlucaFicarelli Apr 8, 2022

Choose a reason for hiding this comment

GianlucaFicarelli Apr 8, 2022

Choose a reason for hiding this comment

GianlucaFicarelli Apr 10, 2022

Choose a reason for hiding this comment