Added support to download example datasets #24

rjoberon · 2024-03-01T14:20:21Z

Dear Sebastian,

Inspired by other libraries (e.g., Seaborn and scikit-learn) I have added support to download exemplary contexts from the repository https://github.com/fcatools/contexts. For more context, please have a look at this post on the FCA mailing list.

We can discuss details on how to properly handle, curate and implement this but I hope you like the idea. I'd happily make you (co)owner of the conexts repo or https://github.com/orgs/fcatools/ such that you have control over what can be loaded.

codecov-commenter · 2024-03-03T17:22:00Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (90214c2) to head (6890a51).

❗ Current head 6890a51 differs from pull request most recent head 2000348. Consider uploading reports for the commit 2000348 to get more accurate results

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff            @@
##            master       #24   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           23        24    +1     
  Lines         1462      1471    +9     
=========================================
+ Hits          1462      1471    +9

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

xflr6

Thanks!

xflr6 · 2024-03-03T17:22:16Z

concepts/examples.py

+
+
+# inspired by https://github.com/mwaskom/seaborn/blob/master/seaborn/utils.py#L524
+def load_dataset(name: str, data_src: typing.Optional[str] = DATASET_SOURCE,


nit: How about keyword-only arguments for eveything except name (and maybe that one even positional-only)?

Good idea. I implemented this in the latest commits.

xflr6 · 2024-03-03T17:23:24Z

concepts/examples.py

+
+    # TODO: implement caching here?
+
+    return Context.fromstring(urlopen(url).read().decode(encoding), 'cxt')


Maybe we should use a context-manager for urlopen()?

Definitely. Done.

xflr6 · 2024-03-03T17:23:37Z

tests/test_examples.py

@@ -0,0 +1,10 @@
+import pytest


unused import

xflr6 · 2024-03-03T17:29:38Z

tests/test_examples.py

+
+
+def test_load_dataset():
+    context = concepts.load_dataset('livingbeings_en')


We should probbaly not depend on internet connectivity in the test.

How about having a mocked test and another one doing the actual thing that would be opt-in with a flag?

Something mildy related here: https://github.com/xflr6/graphviz/blob/e5578d39009469df2b7c6743458970643e228226/tests/conftest.py#L5

I agree but I did not manage to implement this. The examples I found are mainly targeting unittest and not pytest (or pytest with the requests module).

xflr6 · 2024-03-03T17:31:32Z

Thanks for the PR.

We can discuss details on how to properly handle, curate and implement this but I hope you like the idea.

Of course :) In recent times, I did not have much time for my open source projects but it would still be nice to make at least some small improvements.

rjoberon added 2 commits March 1, 2024 14:40

added support to download example datasets

1622675

added tests

6890a51

xflr6 reviewed Mar 3, 2024

View reviewed changes

rjoberon added 4 commits March 15, 2024 07:55

made "name" argument positional and all other keyword-only

bcb1dec

using a context manager to open the URL

9426275

removed unused import

861eac1

contextlib not explicitly needed according to documentation

2000348

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support to download example datasets #24

Added support to download example datasets #24

rjoberon commented Mar 1, 2024

codecov-commenter commented Mar 3, 2024 •

edited

Loading

xflr6 left a comment

xflr6 Mar 3, 2024

rjoberon Mar 15, 2024

xflr6 Mar 3, 2024

rjoberon Mar 15, 2024

xflr6 Mar 3, 2024

rjoberon Mar 15, 2024

xflr6 Mar 3, 2024

rjoberon Mar 15, 2024 •

edited

Loading

xflr6 commented Mar 3, 2024



		# inspired by https://github.com/mwaskom/seaborn/blob/master/seaborn/utils.py#L524
		def load_dataset(name: str, data_src: typing.Optional[str] = DATASET_SOURCE,


		# TODO: implement caching here?

		return Context.fromstring(urlopen(url).read().decode(encoding), 'cxt')



		def test_load_dataset():
		context = concepts.load_dataset('livingbeings_en')

Added support to download example datasets #24

Are you sure you want to change the base?

Added support to download example datasets #24

Conversation

rjoberon commented Mar 1, 2024

codecov-commenter commented Mar 3, 2024 • edited Loading

Codecov Report

xflr6 left a comment

Choose a reason for hiding this comment

xflr6 Mar 3, 2024

Choose a reason for hiding this comment

rjoberon Mar 15, 2024

Choose a reason for hiding this comment

xflr6 Mar 3, 2024

Choose a reason for hiding this comment

rjoberon Mar 15, 2024

Choose a reason for hiding this comment

xflr6 Mar 3, 2024

Choose a reason for hiding this comment

rjoberon Mar 15, 2024

Choose a reason for hiding this comment

xflr6 Mar 3, 2024

Choose a reason for hiding this comment

rjoberon Mar 15, 2024 • edited Loading

Choose a reason for hiding this comment

xflr6 commented Mar 3, 2024

codecov-commenter commented Mar 3, 2024 •

edited

Loading

rjoberon Mar 15, 2024 •

edited

Loading