Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update BNF authority file links #1564

Open
ahankinson opened this issue Apr 19, 2024 · 9 comments
Open

Update BNF authority file links #1564

ahankinson opened this issue Apr 19, 2024 · 9 comments
Assignees

Comments

@ahankinson
Copy link
Contributor

ahankinson commented Apr 19, 2024

The BNF authority links are a bit of a mess.

Some cite the full ARK, e.g., https://muscat.rism.info/admin/institutions/40006238. This doesn't work because the link formatter expects just a number. So it turns it in to https://data.bnf.fr/ark:/12148/cbark:/12148/cb14783478b.

Some records cite just the number, e.g., https://muscat.rism.info/admin/institutions/51005989 gives 15386579.

However, this also does not work, because it's missing the check digit (4) in this case, which is part of the ARK. In other words:

ark:/12148/cb153865794 -> cb is the prefix, 15386579 is the identifier, 4 is the check digit.

But wait! It gets better.

According to this page: https://www.bnf.fr/fr/lidentifiant-ark-archival-resource-key, the BNF have several schemes for their ARK prefixes: dp for digitized documents, cb for catalogue records, and mm for educational resources. I happen to also know that there are others, e.g., ark:/12148/btv1b105513309. The best way to untangle all of these is to go through their ARK resolver, https://ark.bnf.fr.

So, this is just a long-winded way of saying that:

  • I think the BNF identifier in Muscat should be configured to point to https://ark.bnf.fr/. This requires the use of the full ARK in our identifiers, but corresponds to the "Identifier" that the BNF themselves cite on their catalogue pages: https://catalogue.bnf.fr/ark:/12148/cb14783478b -> "Identifiant de la notice", and, if we ever require it, on their Gallica site: https://gallica.bnf.fr/ark:/12148/btv1b8422976g -> "Identifier".
  • the BNF identifiers in Muscat should have the full ARK in them, and not just the "identifier" bit (since it's really hard to figure out the check digit otherwise, and if we don't have the prefix we might be mixing up the different types of records.)

Tagging @Docudoctor, since I've already roped him in to this on Muscat.

@xhero xhero self-assigned this Apr 30, 2024
@xhero
Copy link
Contributor

xhero commented Apr 30, 2024

I can surely change the hardcoded link from https://data.bnf.fr/ark:/12148/cb to to https://ark.bnf.fr/, but how should we go to fix all the identifiers?

@xhero
Copy link
Contributor

xhero commented Apr 30, 2024

I updated the link so it should work in the next release. @alexandermarxen and @Docudoctor could you have a look to see what is the path to fix the data here? We can probably isolate the identifiers that do not start with "ark:" but they will need to be checked by hand.

@alexandermarxen
Copy link

Thank you very much! I'm afraid that there are a lot of records, especially when it comes to persons. Could I have a list of the affected records?

@ahankinson
Copy link
Contributor Author

I think these are the people records that have a bnf identifier, but that identifier doesn't start with "ark". ~2,100.

ark_people.csv

xhero added a commit that referenced this issue Apr 30, 2024
* develop:
  #1089 Rename all the labels! (first try)
  #1564 Update ARK resolver to the proper link
  Fix #1552, add attribution qualifiers
  Fix #1567, add missing mandatory fields
  Fix double escaping of & in xml export
  Clean up
  Implement user selection for folder and comments
  Migrate to new user selection
  Better implementation using flexdatalist
  Basic user select implementation, for #1497
  Allow for unsorted subfield_select_codes values
@alexandermarxen
Copy link

Thank you very much! It's about the order of magnitude I was expecting.

@BaMikusi
Copy link

(FYI: Guido will be able to look into this only next week.)

@Docudoctor
Copy link

Hi!
There are round about 30 institution records with a BNF identifier. Some with ark, some without.
I can easyly find them in "Any field". My question is, how can I cite the BNF identifiers?

@ahankinson
Copy link
Contributor Author

You should include the full ARK, including the ark:/ prefix. So for example:

https://muscat.rism.info/admin/institutions/40006238/edit

This is cited correctly, even though the link does not work. The link will start working after the next Muscat release.

This record, however:

https://muscat.rism.info/admin/institutions/40001144

Is not correct. The BnF identifier should be: ark:/12148/cb12229245w

You can see this in the BnF catalogue: https://catalogue.bnf.fr/ark:/12148/cb12229245w Look under the "Identifiant de la notice" section.

@Docudoctor
Copy link

I know, but I was a little bit confused, because of the change of the hardcoded link. So good luck! :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants