You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Until recently CTrees were generated either locally or through getpapers or quickscrape. The automatically generated files contain at least one reserved file such as fulltext.pdf and this was used by CMine software to determine which directories in a CProject are actually CTrees. This was always recognised to be a heuristic, and recently with bulk download of metadata from Crossref we see many potential CTree without reserved files or even without any files. Here's a simple example:
The first directory is retrieved by quickscrape from EPMC and the heuristics indicate it to be a potential CTree. The other two are retrieved from getpapers on Crossref followed by quickscrape which creates only metadata but currently are not flagged as CTrees. The empty directory is created (I think) by quickscrape which then fails to retrieve anything.
The original motivation for the heuristics is that we may introduce new reserved directories into a CProject and users might also introduce non-ctree directories. There was also the idea that we have a reserved file (e.g. metadata.json or log.xml) in any CTree directory`. At present I favour this, and we should discuss what is in it.
[See also https://github.com//issues/10 ]
Until recently
CTree
s were generated either locally or throughgetpapers
orquickscrape
. The automatically generated files contain at least one reserved file such asfulltext.pdf
and this was used byCMine
software to determine which directories in aCProject
are actuallyCTree
s. This was always recognised to be a heuristic, and recently with bulk download of metadata fromCrossref
we see many potentialCTree
without reserved files or even without any files. Here's a simple example:The first directory is retrieved by
quickscrape
fromEPMC
and the heuristics indicate it to be a potentialCTree
. The other two are retrieved fromgetpapers
onCrossref
followed byquickscrape
which creates only metadata but currently are not flagged asCTree
s. The empty directory is created (I think) byquickscrape
which then fails to retrieve anything.The original motivation for the heuristics is that we may introduce new reserved directories into a
CProject
and users might also introduce non-ctree
directories. There was also the idea that we have a reserved file (e.g.metadata.json
orlog.xml
) in anyCTree
directory`. At present I favour this, and we should discuss what is in it.Currently I have added a switch
which allows users to toggle this behaviour. I will also add
results.json
to the reserved files which flag "Ctree-ness".The text was updated successfully, but these errors were encountered: