UI/Design Change: Renaming Version (.vX) to Something less Confusing. #22
Replies: 1 comment 2 replies
-
I would agree that the current use of version is rather unintuitive. Especially since there's no enforcement that they're actually based from the same data (it is as you said: just based on the "name" of the dataset). It does make some sense that e.g., a binarized version of multiclass classification is "some version of" the same dataset, but I can imagine that many people will consider the more strict association with versioning initially (as would I); primarily to remove errors from the data or maybe in some cases adding additional samples. But then the question becomes: how should we do it? This change will also impact the client libraries (e.g., python allows ( |
Beta Was this translation helpful? Give feedback.
-
Currently, the website/UI has a button for dataset versions (e.g., https://openml.org/search?type=data&sort=version&status=any&order=asc&exact_name=nyc-taxi-green-dec-2016), but as far as I can tell, these are not continuous versions of the same dataset but rather a tag for all datasets with the same name.
This has caused much confusion among researchers and practitioners trying to find source datasets or continuously versioning their datasets.
Furthermore, computer scientists likely associate versions with something akin to a git history.
I propose to rename this feature in the website to something less ambiguous (like name-alikes)
This would be a relatively small change only in the website/UI to avoid many problems in practice. We could still rename it back, once a "real" version history exists.
What do you think? @PGijsbers @joaquinvanschoren
Rel.: openml/openml.org#95
Beta Was this translation helpful? Give feedback.
All reactions