Match with DBLP dataset #21
Replies: 6 comments 1 reply
-
Hi @cverluise, Thanks a lot for kicking this off!
Obviously, the hard match would probably guarantee better quality, right (?): I assume the coverage between Crossref and DBLP will probably not be perfect, and matching via the current match with Crossref may leave something on the way. At the same time, I guess that the hard match requires and additional effort whose cost I cannot judge at this stage, but you probably have a good idea. Let me know which direction shall we take and how I can help. Cheers! |
Beta Was this translation helpful? Give feedback.
-
Hello @ste210, Thanks for your detailed reply.
If you send me the list of distinct dois in DBLP, I can check the intersection very easily. Then we know which matching option is the most appropriate. My personal email will be perfect.
Thanks for sharing that, this will be highly valuable in the hard matching option.
So, if i'm right, excluding PhD theses, ~82% of the documents have a DOI. Pretty nice. Do you have any idea of the kind of document which would systematically miss a DOI? What kind of bias would we introduce by neglecting documents which don't have any DOI? Cheers, Cyril |
Beta Was this translation helpful? Give feedback.
-
Hello @ste210, thanks for sharing the list of DOIs covered by the DBLP dataset. I looked at the overlap between the DBLP and CrossRef datasets (in terms of DOIs). Here is the baseline:
Reproduce main resultSELECT
count(dblp.doi) as nb_both
FROM
`npl-parsing.external.dblp` as dblp,
`npl-parsing.external.crossref` as crossref
WHERE
LOWER(dblp.doi)=LOWER(crossref.doi)
Here is the list of documents that are in the DBLP but not in the CrossRef. dblp_not_crossref.csv.gz Soft matching and consolidationAt this point, my understanding is that the soft matching would be a good start. I will improve the documentation with a specific section focused on: "how to join the PatCit dataset with external datasets in a minute" (see #31 raised by @felixpoege). We might want to add some of the CrossRef/DBLP attributes that cover standard use cases to the baseline PatCit dataset. If you have any ideas on that point, feel free to indicate which fields would be the most valuable to you in #30 (be it from CrossRef or DBLP). My first idea would be to add the DBLP Fyi, we matched 1,176,036 NPL citations with a DOI which is in the DBLP dataset. This corresponds to 254,931 unique DOIs - it appears that many NPLs reported in PatStat are duplicates. DBLP DOIs matchedSELECT
count(distinct(dblp.doi)) as nb_unique_dois_from_dblp,
count(dblp.doi) as nb_dois_from_dblp
FROM
`npl-parsing.external.dblp` as dblp,
`npl-parsing.patcit.v02_npl` as npl
WHERE LOWER(dblp.doi)=LOWER(npl.doi)
Going further, hard matchingIf we want to go further (hard matching) for the remaining 10% of the DBLP DOIs, I think that the way to go would be to fit the DBLP into the same json schema as the CrossRef data - at least for the fields used in the matching process. This should be relatively easy to do on the basis of the xml version of the database. Of course, this should be double checked and discussed with the Grobid/biblio-glutton team. Let me know if you think that it is worth it. Eventually, we could discuss the opportunity to add documents that are in the DBLP dataset but do not have any DOI using the DBLP internal doc id. This might make the aforementioned task worthier. This might be a bit touchy and should be discussed with the Grobid/biblio-glutton first. Let me know your thoughts. Hope it helps |
Beta Was this translation helpful? Give feedback.
-
Hi @cverluise, I finally could look a bit more in detail into this. I wanted to make some more step on the data to be able to say a bit more about what would be missed. Indeed I think the match based on the DOI is a great start with surprisingly good coverage.
I took this list and counted the numbers of: 1. publications that would be found in Scopus (as a rough quality screening); 2. publications that are from conference proceedings (which as you certainly know are valuable in computer science); 3. publication and conferences count by conference rank, which comes from a match with the CORE ranking. This is the result:
Overall, there is a good number of relevant publications, also considering that top conference proceedings are also lower in number in the general data. I thought one alternative could be to extend the match simply based on title and other info when a DOI is not there...? Best, Stefano |
Beta Was this translation helpful? Give feedback.
-
hello @ste210, Sorry for the delay and thanks for the insights.
That should be discussed with the grobid/biblio-glutton team, but I'm almost sure that the hard match is feasible by i) appending the part of the DBLP database which is not covered by CrossRef to the Crossref database and ii) making sure that the attributes used by biblio-glutton are properly mapped from the DBLP format to the Crossref json format. Regarding the missing DOI, in the end, this is just a unique identifier from which biblio-glutton gets additional data from PubMed and Unpaywall if possible. So, we could fake it using the DBLP_id or something like that. I might have missed part of the complexity, and I will reach out to the grobid/biblio-glutton team to clarify these points. If I'm right, then, the next step would be to format the DBLP database in the CrossRef format - at least for the subset which is not covered by CrossRef and the attributes which are used by biblio-glutton. Would you be ready to do that? Cheers, Cyril |
Beta Was this translation helpful? Give feedback.
-
Hello, as a clarification, what I mean when I say "to format DBLP in CrossRef format" is the following: The DBLP dataset comes as a set of xml objects: DBLP data sample<article key="journals/cacm/Gentry10" mdate="2010-04-26">
<author>Craig Gentry</author>
<title>Computing arbitrary functions of encrypted data.</title>
<pages>97-105</pages>
<year>2010</year>
<volume>53</volume>
<journal>Commun. ACM</journal>
<number>3</number>
<ee>http://doi.acm.org/10.1145/1666420.1666444</ee>
<url>db/journals/cacm/cacm53.html#Gentry10</url>
</article> The crossref dataset that is consumed by biblio-glutton is a jsonl (new line delimited json) file: Crossref data sample{ "_id": { "$oid": "5b907476f1c57925e77f7436" } ,
"indexed": { "date-parts":[[2018 ,
3 ,
17]] ,
"date-time": "2018-03-17T20:59:31Z" ,
"timestamp": { "$numberLong": "1521320371467" } } ,
"reference-count": 53 ,
"publisher": "Wiley-Blackwell" ,
"issue": "4" ,
"license":[{ "URL": "http://doi.wiley.com/10.1002/tdm_license_1.1" ,
"start": { "date-parts":[[2015 ,
9 ,
1]] ,
"date-time": "2015-09-01T00:00:00Z" ,
"timestamp": { "$numberLong": "1441065600000" } } ,
"delay-in-days": 782 ,
"content-version": "tdm" }] ,
"content-domain": { "domain":[] ,
"crossmark-restriction": false } ,
"short-container-title":["Int Wound J"] ,
"published-print": { "date-parts":[[2015 ,
8]] } ,
"DOI": "10.1111/iwj.12120" ,
"type": "journal-article" ,
"created": { "date-parts":[[2013 ,
7 ,
11]] ,
"date-time": "2013-07-11T07:01:43Z" ,
"timestamp": { "$numberLong": "1373526103000" } } ,
"page": "387-396" ,
"source": "Crossref" ,
"is-referenced-by-count": 3 ,
"title":["Adipose-derived stem cells and keratinocytes in a chronic wound cell culture model: the role of hydroxyectoine"] ,
"prefix": "10.1111" ,
"volume": "12" ,
"author":[{ "given": "Oliver C" ,
"family": "Thamm" ,
"sequence": "first" ,
"affiliation":[{ "name": "Clinic for Plastic and Reconstructive Surgery, Handsurgery, Burn Care Center; University of Witten/Herdecke, Cologne-Merheim Medical Center; Köln Germany" }] } ,
{ "given": "Panagiotis" ,
"family": "Theodorou" ,
"sequence": "additional" ,
"affiliation":[{ "name": "Clinic for Plastic and Reconstructive Surgery, Handsurgery, Burn Care Center; University of Witten/Herdecke, Cologne-Merheim Medical Center; Köln Germany" }] } ,
{ "given": "Ewa" ,
"family": "Stuermer" ,
"sequence": "additional" ,
"affiliation":[{ "name": "Institute for Research in Operative Medicine (IFOM); University of Witten/Herdecke; Köln Germany" }] } ,
{ "given": "Max J" ,
"family": "Zinser" ,
"sequence": "additional" ,
"affiliation":[{ "name": "Clinic for Plastic and Reconstructive Surgery, Handsurgery, Burn Care Center; University of Witten/Herdecke, Cologne-Merheim Medical Center; Köln Germany" }] } ,
{ "given": "Edmund A" ,
"family": "Neugebauer" ,
"sequence": "additional" ,
"affiliation":[{ "name": "Institute for Research in Operative Medicine (IFOM); University of Witten/Herdecke; Köln Germany" }] } ,
{ "given": "Paul C" ,
"family": "Fuchs" ,
"sequence": "additional" ,
"affiliation":[{ "name": "Clinic for Plastic and Reconstructive Surgery, Handsurgery, Burn Care Center; University of Witten/Herdecke, Cologne-Merheim Medical Center; Köln Germany" }] } ,
{ "given": "Paola" ,
"family": "Koenen" ,
"sequence": "additional" ,
"affiliation":[{ "name": "Department of Trauma and Orthopedic Surgery; University of Witten/Herdecke, Cologne-Merheim Medical Center; Köln Germany" }] }] ,
"member": "311" ,
"published-online": { "date-parts":[[2013 ,
7 ,
11]] } ,
"reference":[{ "key": "10.1111/iwj.12120-BIB0001|iwj12120-cit-0001" ,
"doi-asserted-by": "crossref" ,
"first-page": "763" ,
"DOI": "10.1111/j.1524-475X.2009.00543.x" ,
"article-title": "Human skin wounds: a major and snowballing threat to public health and the economy" ,
"volume": "17" ,
"author": "Sen" ,
"year": "2009" ,
"journal-title": "Wound Repair Regen" } ,
{ "key": "10.1111/iwj.12120-BIB0002|iwj12120-cit-0002" ,
"first-page": "513" ,
"article-title": "Chemokines in cutaneous wound healing" ,
"volume": "69" ,
"author": "Gillitzer" ,
"year": "2001" ,
"journal-title": "J Leukoc Biol" } ,
{ "key": "10.1111/iwj.12120-BIB0003|iwj12120-cit-0003" ,
"doi-asserted-by": "crossref" ,
"first-page": "434" ,
"DOI": "10.1111/j.1067-1927.2005.130411.x" ,
"article-title": "Early cellular changes of human mesenchymal stem cells and their interaction with other cells" ,
"volume": "13" ,
"author": "Akino" ,
"year": "2005" ,
"journal-title": "Wound Repair Regen" } ,
{ "key": "10.1111/iwj.12120-BIB0004|iwj12120-cit-0004" ,
"doi-asserted-by": "crossref" ,
"first-page": "392" ,
"DOI": "10.1111/j.1524-475X.2000.00392.x" ,
"article-title": "Proteinases, their inhibitors, and cytokine profiles in acute wound fluid" ,
"volume": "8" ,
"author": "Baker" ,
"year": "2000" ,
"journal-title": "Wound Repair Regen" } ,
{ "key": "10.1111/iwj.12120-BIB0005|iwj12120-cit-0005" ,
"doi-asserted-by": "crossref" ,
"first-page": "442" ,
"DOI": "10.1046/j.1524-475X.1999.00442.x" ,
"article-title": "Analysis of the acute and chronic wound environments: the role of proteases and their inhibitors" ,
"volume": "7" ,
"author": "Trengove" ,
"year": "1999" ,
"journal-title": "Wound Repair Regen" } ,
{ "key": "10.1111/iwj.12120-BIB0006|iwj12120-cit-0006" ,
"doi-asserted-by": "crossref" ,
"first-page": "64" ,
"DOI": "10.1111/1523-1747.ep12359590" ,
"article-title": "Wound fluid from chronic leg ulcers contains elevated levels of metalloproteinases MMP-2 and MMP-9" ,
"volume": "101" ,
"author": "Wysocki" ,
"year": "1993" ,
"journal-title": "J Invest Dermatol" } ,
{ "key": "10.1111/iwj.12120-BIB0007|iwj12120-cit-0007" ,
"doi-asserted-by": "crossref" ,
"first-page": "423" ,
"DOI": "10.1046/j.1524-475X.1999.00423.x" ,
"article-title": "Matrix metalloproteinases in repair" ,
"volume": "7" ,
"author": "Parks" ,
"year": "1999" ,
"journal-title": "Wound Repair Regen" } ,
{ "key": "10.1111/iwj.12120-BIB0008|iwj12120-cit-0008" ,
"doi-asserted-by": "crossref" ,
"first-page": "743" ,
"DOI": "10.1111/1523-1747.ep12365637" ,
"article-title": "Wound fluids from human pressure ulcers contain elevated matrix metalloproteinase levels and activity compared to surgical wound fluids" ,
"volume": "107" ,
"author": "Yager" ,
"year": "1996" ,
"journal-title": "J Invest Dermatol" } ,
{ "key": "10.1111/iwj.12120-BIB0009|iwj12120-cit-0009" ,
"doi-asserted-by": "crossref" ,
"first-page": "321" ,
"DOI": "10.1046/j.1524-475X.1996.40307.x" ,
"article-title": "Biochemical analysis of acute and chronic wound environments" ,
"volume": "4" ,
"author": "Tarnuzzer" ,
"year": "1996" ,
"journal-title": "Wound Repair Regen" } ,
{ "key": "10.1111/iwj.12120-BIB0010|iwj12120-cit-0010" ,
"doi-asserted-by": "crossref" ,
"first-page": "226" ,
"DOI": "10.4093/dmj.2011.35.3.226" ,
"article-title": "Comparison of EGF with VEGF non-viral gene therapy for cutaneous wound healing of streptozotocin diabetic mice" ,
"volume": "35" ,
"author": "Ko" ,
"year": "2011" ,
"journal-title": "Diabetes Metab J" } ,
{ "key": "10.1111/iwj.12120-BIB0011|iwj12120-cit-0011" ,
"doi-asserted-by": "crossref" ,
"first-page": "5672" ,
"DOI": "10.1073/pnas.95.10.5672" ,
"article-title": "Neuronal defects and delayed wound healing in mice lacking fibroblast growth factor 2" ,
"volume": "95" ,
"author": "Ortega" ,
"year": "1998" ,
"journal-title": "Proc Natl Acad Sci USA" } ,
{ "key": "10.1111/iwj.12120-BIB0012|iwj12120-cit-0012" ,
"doi-asserted-by": "crossref" ,
"first-page": "1534" ,
"DOI": "10.1016/j.bjps.2008.05.036" ,
"article-title": "Searching for the right timing of surgical delay: angiogenesis, vascular endothelial growth factor and perfusion changes in a skin-flap model" ,
"volume": "62" ,
"author": "Holzbach" ,
"year": "2009" ,
"journal-title": "J Plast Reconstr Aesthet Surg" } ,
{ "key": "10.1111/iwj.12120-BIB0013|iwj12120-cit-0013" ,
"first-page": "522" ,
"article-title": "Roles of cytokines in wound healing processes" ,
"volume": "100" ,
"author": "Ono" ,
"year": "1999" ,
"journal-title": "Nihon Geka Gakkai Zasshi" } ,
{ "key": "10.1111/iwj.12120-BIB0014|iwj12120-cit-0014" ,
"doi-asserted-by": "crossref" ,
"first-page": "189" ,
"DOI": "10.1093/jb/mvr060" ,
"article-title": "Critical role of c-Jun N-terminal kinase in regulating bFGF-induced angiogenesis in vitro" ,
"volume": "150" ,
"author": "Kaikai" ,
"year": "2011" ,
"journal-title": "J Biochem" } ,
{ "key": "10.1111/iwj.12120-BIB0015|iwj12120-cit-0015" ,
"doi-asserted-by": "crossref" ,
"first-page": "150" ,
"DOI": "10.5021/ad.2011.23.2.150" ,
"article-title": "Effects of human adipose-derived stem cells on cutaneous wound healing in nude mice" ,
"volume": "23" ,
"author": "Lee" ,
"year": "2011" ,
"journal-title": "Ann Dermatol" } ,
{ "key": "10.1111/iwj.12120-BIB0016|iwj12120-cit-0016" ,
"doi-asserted-by": "crossref" ,
"first-page": "317" ,
"DOI": "10.1097/SAP.0b013e31817f01b6" ,
"article-title": "Accelerated wound healing in healing-impaired db/db mice by autologous adipose tissue-derived stromal cells combined with atelocollagen matrix" ,
"volume": "62" ,
"author": "Nambu" ,
"year": "2009" ,
"journal-title": "Ann Plast Surg" } ,
{ "key": "10.1111/iwj.12120-BIB0017|iwj12120-cit-0017" ,
"doi-asserted-by": "crossref" ,
"first-page": "503" ,
"DOI": "10.1161/ATVBAHA.108.178962" ,
"article-title": "Cell therapy based on adipose tissue-derived stromal cells promotes physiological and pathological wound healing" ,
"volume": "29" ,
"author": "Ebrahimian" ,
"year": "2009" ,
"journal-title": "Arterioscler Thromb Vasc Biol" } ,
{ "key": "10.1111/iwj.12120-BIB0018|iwj12120-cit-0018" ,
"first-page": "670" ,
"article-title": "Competitive RNA templates for detection and quantitation of growth factors, cytokines, extracellular matrix components and matrix metalloproteinases by RT-PCR" ,
"volume": "20" ,
"author": "Tarnuzzer" ,
"year": "1996" ,
"journal-title": "Biotechniques" } ,
{ "key": "10.1111/iwj.12120-BIB0019|iwj12120-cit-0019" ,
"doi-asserted-by": "crossref" ,
"first-page": "918" ,
"DOI": "10.1046/j.1523-1747.2003.12471.x" ,
"article-title": "Iron and 8-isoprostane levels in acute and chronic wounds" ,
"volume": "121" ,
"author": "Yeoh-Ellerton" ,
"year": "2003" ,
"journal-title": "J Invest Dermatol" } ,
{ "key": "10.1111/iwj.12120-BIB0020|iwj12120-cit-0020" ,
"doi-asserted-by": "crossref" ,
"first-page": "4055" ,
"DOI": "10.1128/jb.170.9.4055-4064.1988" ,
"article-title": "Metabolic regulation in Streptomyces parvulus during actinomycin D synthesis, studied with 13C- and 15 N-labeled precursors by 13C and 15N nuclear magnetic resonance spectroscopy and by gas chromatography-mass spectrometry" ,
"volume": "170" ,
"author": "Inbar" ,
"year": "1988" ,
"journal-title": "J Bacteriol" } ,
{ "key": "10.1111/iwj.12120-BIB0021|iwj12120-cit-0021" ,
"doi-asserted-by": "crossref" ,
"first-page": "61" ,
"DOI": "10.1007/BF00174204" ,
"article-title": "Enzyme stabilization be ectoine-type compatible solutes: protection against heating, freezing and drying" ,
"volume": "37" ,
"author": "Lippert" ,
"year": "1992" ,
"journal-title": "Appl Microbiol Biotechnol" } ,
{ "issue": "1-3" ,
"key": "10.1111/iwj.12120-BIB0022|iwj12120-cit-0022" ,
"doi-asserted-by": "crossref" ,
"first-page": "37" ,
"DOI": "10.1016/j.bpc.2010.02.007" ,
"article-title": "The effect of compatible solute ectoines on the structural organization of lipid monolayer and bilayer membranes" ,
"volume": "150" ,
"author": "Harishchandra" ,
"year": "2010" ,
"journal-title": "Biophys Chem" } ,
{ "key": "10.1111/iwj.12120-BIB0023|iwj12120-cit-0023" ,
"doi-asserted-by": "crossref" ,
"first-page": "315" ,
"DOI": "10.1080/14653240600855905" ,
"article-title": "Minimal criteria for defining multipotent mesenchymal stromal cells. The International Society for Cellular Therapy position statement" ,
"volume": "8" ,
"author": "Dominici" ,
"year": "2006" ,
"journal-title": "Cytotherapy" } ,
{ "key": "10.1111/iwj.12120-BIB0024|iwj12120-cit-0024" ,
"doi-asserted-by": "crossref" ,
"first-page": "52" ,
"DOI": "10.1006/abio.2001.5530" ,
"article-title": "A new quantitative method of real time reverse transcription polymerase chain reaction assay based on simulation of polymerase chain reaction kinetics" ,
"volume": "302" ,
"author": "Liu" ,
"year": "2002" ,
"journal-title": "Anal Biochem" } ,
{ "key": "10.1111/iwj.12120-BIB0025|iwj12120-cit-0025" ,
"doi-asserted-by": "crossref" ,
"first-page": "125" ,
"DOI": "10.1111/1523-1747.ep12276531" ,
"article-title": "Spreading and enhanced motility of human keratinocytes on fibronectin" ,
"volume": "85" ,
"author": "O'Keefe" ,
"year": "1985" ,
"journal-title": "J Invest Dermatol" } ,
{ "key": "10.1111/iwj.12120-BIB0026|iwj12120-cit-0026" ,
"doi-asserted-by": "crossref" ,
"first-page": "447" ,
"DOI": "10.1046/j.1524-4725.2000.99281.x" ,
"article-title": "Level of fibronectin mRNA is markedly increased in human chronic wounds" ,
"volume": "26" ,
"author": "Ongenae" ,
"year": "2000" ,
"journal-title": "Dermatol Surg" } ,
{ "key": "10.1111/iwj.12120-BIB0027|iwj12120-cit-0027" ,
"doi-asserted-by": "crossref" ,
"first-page": "410" ,
"DOI": "10.1111/1523-1747.ep12499839" ,
"article-title": "Degradation of fibronectin and vitronectin in chronic wound fluid: analysis by cell blotting, immunoblotting, and cell adhesion assays" ,
"volume": "98" ,
"author": "Grinnell" ,
"year": "1992" ,
"journal-title": "J Invest Dermatol" } ,
{ "key": "10.1111/iwj.12120-BIB0028|iwj12120-cit-0028" ,
"doi-asserted-by": "crossref" ,
"first-page": "185" ,
"DOI": "10.1111/j.1742-481X.2012.00967.x" ,
"article-title": "Chronic wounds - is cellular 'reception' at fault? Examining integrins and intracellular signalling" ,
"volume": "10" ,
"author": "Widgerow" ,
"year": "2013" ,
"journal-title": "Int Wound J" } ,
{ "key": "10.1111/iwj.12120-BIB0029|iwj12120-cit-0029" ,
"doi-asserted-by": "crossref" ,
"first-page": "519" ,
"DOI": "10.1016/j.clindermatol.2010.03.009" ,
"article-title": "Chronic wound infection: facts and controversies" ,
"volume": "28" ,
"author": "Siddiqui" ,
"year": "2010" ,
"journal-title": "Clin Dermatol" } ,
{ "key": "10.1111/iwj.12120-BIB0030|iwj12120-cit-0030" ,
"doi-asserted-by": "crossref" ,
"first-page": "45" ,
"DOI": "10.1111/j.1524-475X.2007.00290.x" ,
"article-title": "Inhibition of keratinocyte migration by lipopolysaccharide" ,
"volume": "16" ,
"author": "Loryman" ,
"year": "2008" ,
"journal-title": "Wound Repair Regen" } ,
{ "key": "10.1111/iwj.12120-BIB0031|iwj12120-cit-0031" ,
"doi-asserted-by": "crossref" ,
"first-page": "1569" ,
"DOI": "10.1038/jid.2008.405" ,
"article-title": "Chemokine-mediated migration of skin-derived stem cells: predominant role for CCL5/RANTES" ,
"volume": "129" ,
"author": "Kroeze" ,
"year": "2009" ,
"journal-title": "J Invest Dermatol" } ,
{ "key": "10.1111/iwj.12120-BIB0032|iwj12120-cit-0032" ,
"doi-asserted-by": "crossref" ,
"first-page": "419" ,
"DOI": "10.1007/s00403-009-1011-1" ,
"article-title": "Protease and pro-inflammatory cytokine concentrations are elevated in chronic compared to acute wounds and can be modulated by collagen type I in vitro" ,
"volume": "302" ,
"author": "Wiegand" ,
"year": "2010" ,
"journal-title": "Arch Dermatol Res" } ,
{ "key": "10.1111/iwj.12120-BIB0033|iwj12120-cit-0033" ,
"doi-asserted-by": "crossref" ,
"first-page": "1183" ,
"DOI": "10.1007/s00109-008-0378-3" ,
"article-title": "IKK-2 is required for TNF-alpha-induced invasion and proliferation of human mesenchymal stem cells" ,
"volume": "86" ,
"author": "Bocker" ,
"year": "2008" ,
"journal-title": "J Mol Med (Berl)" } ,
{ "key": "10.1111/iwj.12120-BIB0034|iwj12120-cit-0034" ,
"doi-asserted-by": "crossref" ,
"first-page": "181" ,
"DOI": "10.1046/j.1524-475X.1993.10308.x" ,
"article-title": "Inhibition of cell proliferation by chronic wound fluid" ,
"volume": "1" ,
"author": "Bucalo" ,
"year": "1993" ,
"journal-title": "Wound Repair Regen" } ,
{ "key": "10.1111/iwj.12120-BIB0035|iwj12120-cit-0035" ,
"doi-asserted-by": "crossref" ,
"first-page": "734" ,
"DOI": "10.1016/S0741-5214(99)70113-8" ,
"article-title": "The proliferative capacity of neonatal skin fibroblasts is reduced after exposure to venous ulcer wound fluid: a potential mechanism for senescence in venous ulcers" ,
"volume": "30" ,
"author": "Mendez" ,
"year": "1999" ,
"journal-title": "J Vasc Surg" } ,
{ "key": "10.1111/iwj.12120-BIB0036|iwj12120-cit-0036" ,
"doi-asserted-by": "crossref" ,
"first-page": "603" ,
"DOI": "10.1126/science.2683075" ,
"article-title": "G1 events and regulation of cell proliferation" ,
"volume": "246" ,
"author": "Pardee" ,
"year": "1989" ,
"journal-title": "Science" } ,
{ "key": "10.1111/iwj.12120-BIB0037|iwj12120-cit-0037" ,
"doi-asserted-by": "crossref" ,
"first-page": "466" ,
"DOI": "10.1111/j.0022-202X.2004.23557.x" ,
"article-title": "Chronic wound fluid suppresses proliferation of dermal fibroblasts through a Ras-mediated signaling pathway" ,
"volume": "124" ,
"author": "Seah" ,
"year": "2005" ,
"journal-title": "J Invest Dermatol" } ,
{ "key": "10.1111/iwj.12120-BIB0038|iwj12120-cit-0038" ,
"first-page": "170" ,
"article-title": "The anaerobic and aerobic microbiology of wounds: a review" ,
"volume": "10" ,
"author": "Bowler" ,
"year": "1998" ,
"journal-title": "Wounds" } ,
{ "key": "10.1111/iwj.12120-BIB0039|iwj12120-cit-0039" ,
"doi-asserted-by": "crossref" ,
"first-page": "603" ,
"DOI": "10.1111/1523-1747.ep12583377" ,
"article-title": "Staphylococcal toxins and protein A differentially induce cytotoxicity and release of tumor necrosis factor-alpha from human keratinocytes" ,
"volume": "107" ,
"author": "Ezepchuk" ,
"year": "1996" ,
"journal-title": "J Invest Dermatol" } ,
{ "key": "10.1111/iwj.12120-BIB0040|iwj12120-cit-0040" ,
"doi-asserted-by": "crossref" ,
"first-page": "165" ,
"DOI": "10.1016/j.phrs.2008.06.004" ,
"article-title": "Oxidative stress in normal and impaired wound repair" ,
"volume": "58" ,
"author": "Schafer" ,
"year": "2008" ,
"journal-title": "Pharmacol Res" } ,
{ "key": "10.1111/iwj.12120-BIB0041|iwj12120-cit-0041" ,
"first-page": "473" ,
"article-title": "Oxygen free radicals and wound healing" ,
"volume": "17" ,
"author": "White" ,
"year": "1990" ,
"journal-title": "Clin Plast Surg" } ,
{ "key": "10.1111/iwj.12120-BIB0042|iwj12120-cit-0042" ,
"doi-asserted-by": "crossref" ,
"first-page": "516" ,
"DOI": "10.1097/00000637-199711000-00012" ,
"article-title": "Oxygen free radicals impair wound healing in ischemic rat skin" ,
"volume": "39" ,
"author": "Senel" ,
"year": "1997" ,
"journal-title": "Ann Plast Surg" } ,
{ "key": "10.1111/iwj.12120-BIB0043|iwj12120-cit-0043" ,
"doi-asserted-by": "crossref" ,
"first-page": "61" ,
"DOI": "10.1007/s00411-007-0096-1" ,
"article-title": "UVB induced oxidative stress in human keratinocytes and protective effect of antioxidant agents" ,
"volume": "46" ,
"author": "Jin" ,
"year": "2007" ,
"journal-title": "Radiat Environ Biophys" } ,
{ "key": "10.1111/iwj.12120-BIB0044|iwj12120-cit-0044" ,
"doi-asserted-by": "crossref" ,
"first-page": "e13839" ,
"DOI": "10.1371/journal.pone.0013839" ,
"article-title": "Antioxidants protect keratinocytes against M. ulcerans mycolactone cytotoxicity" ,
"volume": "5" ,
"author": "Gronberg" ,
"year": "2010" ,
"journal-title": "PLoS One" } ,
{ "key": "10.1111/iwj.12120-BIB0045|iwj12120-cit-0045" ,
"doi-asserted-by": "crossref" ,
"first-page": "380" ,
"DOI": "10.1111/j.1751-1097.1999.tb08239.x" ,
"article-title": "Apoptosis, the role of oxidative stress and the example of solar UV radiation" ,
"volume": "70" ,
"author": "Pourzand" ,
"year": "1999" ,
"journal-title": "Photochem Photobiol" } ,
{ "issue": "6 Suppl" ,
"key": "10.1111/iwj.12120-BIB0046|iwj12120-cit-0046" ,
"first-page": "s8" ,
"article-title": "In vivo skin antioxidant effect of a new combination based on a specific Vitis vinifera shoot extract and a biotechnological extract" ,
"volume": "6" ,
"author": "Cornacchione" ,
"year": "2007" ,
"journal-title": "J Drugs Dermatol" } ,
{ "key": "10.1111/iwj.12120-BIB0047|iwj12120-cit-0047" ,
"doi-asserted-by": "crossref" ,
"first-page": "347" ,
"DOI": "10.1016/j.jss.2008.04.023" ,
"article-title": "The role of vascular endothelial growth factor in wound healing" ,
"volume": "153" ,
"author": "Bao" ,
"year": "2009" ,
"journal-title": "J Surg Res" } ,
{ "key": "10.1111/iwj.12120-BIB0048|iwj12120-cit-0048" ,
"doi-asserted-by": "crossref" ,
"first-page": "1935" ,
"DOI": "10.1016/S0002-9440(10)63754-6" ,
"article-title": "Topical vascular endothelial growth factor accelerates diabetic wound healing through increased angiogenesis and by mobilizing and recruiting bone marrow-derived cells" ,
"volume": "164" ,
"author": "Galiano" ,
"year": "2004" ,
"journal-title": "Am J Pathol" } ,
{ "key": "10.1111/iwj.12120-BIB0049|iwj12120-cit-0049" ,
"doi-asserted-by": "crossref" ,
"first-page": "539" ,
"DOI": "10.1097/01.prs.0000173447.81513.7a" ,
"article-title": "Effect of chronic wound exudates and MMP-2/-9 inhibitor on angiogenesis in vitro" ,
"volume": "116" ,
"author": "Ulrich" ,
"year": "2005" ,
"journal-title": "Plast Reconstr Surg" } ,
{ "key": "10.1111/iwj.12120-BIB0050|iwj12120-cit-0050" ,
"doi-asserted-by": "crossref" ,
"first-page": "26" ,
"DOI": "10.1046/j.1524-475X.2002.10903.x" ,
"article-title": "Ratios of activated matrix metalloproteinase-9 to tissue inhibitor of matrix metalloproteinase-1 in wound fluids are inversely correlated with healing of pressure ulcers" ,
"volume": "10" ,
"author": "Ladwig" ,
"year": "2002" ,
"journal-title": "Wound Repair Regen" } ,
{ "issue": "6-7" ,
"key": "10.1111/iwj.12120-BIB0051|iwj12120-cit-0051" ,
"doi-asserted-by": "crossref" ,
"first-page": "1334" ,
"DOI": "10.1016/j.biocel.2007.10.024" ,
"article-title": "Metalloproteinases and their inhibitors: regulators of wound healing" ,
"volume": "40" ,
"author": "Gill" ,
"year": "2008" ,
"journal-title": "Int J Biochem Cell Biol" } ,
{ "key": "10.1111/iwj.12120-BIB0052|iwj12120-cit-0052" ,
"doi-asserted-by": "crossref" ,
"first-page": "191" ,
"DOI": "10.1007/s007920050116" ,
"article-title": "Extrinsic protein stabilization by the naturally occurring osmolytes beta-hydroxyectoine and betaine" ,
"volume": "3" ,
"author": "Knapp" ,
"year": "1999" ,
"journal-title": "Extremophiles" } ,
{ "key": "10.1111/iwj.12120-BIB0053|iwj12120-cit-0053" ,
"doi-asserted-by": "crossref" ,
"first-page": "82" ,
"DOI": "10.1186/1472-6750-7-82" ,
"article-title": "Compatible solutes from hyperthermophiles improve the quality of DNA microarrays" ,
"volume": "7" ,
"author": "Mascellani" ,
"year": "2007" ,
"journal-title": "BMC Biotechnol" }] ,
"container-title":["International Wound Journal"] ,
"link":[{ "URL": "https://api.wiley.com/onlinelibrary/tdm/v1/articles/10.1111%2Fiwj.12120" ,
"content-type": "unspecified" ,
"content-version": "vor" ,
"intended-application": "text-mining" }] ,
"deposited": { "date-parts":[[2017 ,
6 ,
21]] ,
"date-time": "2017-06-21T16:04:49Z" ,
"timestamp": { "$numberLong": "1498061089000" } } ,
"score": 1 ,
"subtitle":["Effects of hydroxyectoine on adipose-derived stem cells and keratinocytes in chronic wounds"] ,
"issued": { "date-parts":[[2013 ,
7 ,
11]] } ,
"references-count": 53 ,
"journal-issue": { "issue": "4" } ,
"URL": "http://dx.doi.org/10.1111/iwj.12120" ,
"relation": { "cites":[] } ,
"ISSN":["1742-4801"] ,
"issn-type":[{ "value": "1742-4801" ,
"type": "print" }] ,
"subject":["Surgery" ,
"Dermatology"] } So fitting the DBLP data to Crossref basically means reorganizing the DBLP xml objects in the same way as the Crossref json objects. Note that only a subset of the attributes (which have to be defined with the grobid/biblio-glutton team) are used in the matching process for us (something like title, journal title, first author, date). Cheers! |
Beta Was this translation helpful? Give feedback.
-
Hello @ste210,
I follow-up our discussion on matching bibliographical references with the DBLP data.
Let's start the discussion here to better understand what can be done and what would be the added value of this additional matching. I think, you (@ste210) can help us a lot better understand those aspects.
Before going any further, some basic points to level the playing field and make sure that anyone can join the discussion:
Questions and feature ideas
To what extent does the DBLP dataset overlap/complement the Crossref dataset? Can you quantify that?
This is an important question because it determines what we call "matching".
In case we are in the hard matching setting, my understanding is that this research direction might well lie upstream
PatCit
. I started discussing it with @kermitt2 who created biblio-glutton, the bibliographical matching service we relie upon.In this case, that would be very useful that you give us the schema (variables, description and ideally number of non-Na occurences) of the DBLP database. This will help us better understand what is feasible. In particular, what's the frequency of documents with a unique identifier (e.g. DOI). Nb: for long text or table, you can use the
<details>...</details>
html anchors to wrap it up.Hope we can move forward fast on this important question!
Cheers!
Beta Was this translation helpful? Give feedback.
All reactions