-
-
Notifications
You must be signed in to change notification settings - Fork 183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Google Scholar meta data #2191
Conversation
Should we not use a sequence number for |
Also, should we include the |
There doesn't seem to be any guidance on whether this needs to be a number or not (unless you have a reference for this?), and even it does, the fact that the year is a number works for me and is easier to know that starting some other unique sequence number to track. Bigger question is what to do about the language?
I deliberately didn't because we don't have a unique PDF URL per chapter and they say:
|
There might not be a Google Scholar guidance on it, but this is an established terminology for periodical literature:
I would omit the language aspect from volume. That metadata can be provided using some other means, if necessary. |
Should we consider permalinking these? Because Chapter PDFs seem like an important outcome which should be discoverable from a stable URI. |
Perhaps we should just omit these two fields if we're not sure? They are not mandatory. Concern with using a volume — without the language — is that many languages have the publication title just as Some PDFs viewers allow deep linking (e.g. this is a deep link that opens the JavaScript chapter in Chrome's built-in PDF viewer) but many don't, and not sure how Google Scholar handles these. I don't think it's right to include the same PDF across all the chapters, and this is optional, so definitely don't think we should include this link. @nrllh you seem to be familiar with these, any thoughts? |
wI would not pollute the title with language information. There should be other means to provide that info. The Internet Archive Scholar has support for searching literature in specific language.
I am thinking, can we use
I was thinking along the lines of publishing the whole book as well as each chapter separately as well, which is in line with how academic publications are distributed. Moreover, page numbers can also be specified in the bibliography, which is also a standard practice. |
I'm not seeing the benefit of having the chapter as individual PDFs to be honest. To me the Web Almanac works best as a website. It has links (including to raw data, queries and comments), interactive figures, an interactive ToC, is responsive and font sizes can be increased or decreased as needed be. Additionally figure descriptions can be shown/hidden and there's navigation to jump around the site. The Ebook PDF is basically the offline version, but it is a lesser experience for that. But still it's good because some (like me!) like the fact you can print as a book so that makes sense. A PDF of a single chapter serves no real purpose to be honest and is worst of both worlds, and not sure I want to manage another version to be honest. I do see the value of getting these listed in a Google Scholar as do considered them well-researched scholarly documents that have been referenced in the past (and hopefully will continue to be). But Google Scholar is happy with HTML so why do you think we need PDFs? Readers can always print to PDF if they want to. Can you give me the compelling use case for chapter PDFs? |
I am more than okay to not have chapter-specific PDFs. Distill is a good example of interactive HTML-only publication. :) |
The more I think about it, the more I think we should only list the English version. That's the official document to quote for references. At least until there's a better supported So in 98f0f6b I restricted to English, changed the It now looks like this:
|
Loosely held opinions, summoned by Sawood: This work seems more like a "technical report", not "journal article". Unless you intend "to be a journal" in the long run, and register an ISSN, I would skip the volume, issue, and journal name fields, and just use The Regarding HTML, PDF, and chapters, seems up to you. Folks can cite individual chapters on their own. It is common in academia to catalog both a full book/encyclopedia/manuscript and the individual chapters, but that is usually because such books frequently have distinct authors for the individual chapters, and to be honest the books are really just collections of papers or essays. It also makes it easier to distribute chapters for course assignments, or to (sigh) sell access to individual chapters instead of the whole book. I would at least have a table of contents with anchor URLs for individual chapters. I wouldn't bother having separate PDFs for each chapter unless you also make separate HTML documents / landing pages, but sort of subjective. I would recommend making a PDF of the whole work available, with page numbers as a lot of folks will still print out subsets or the whole thing for reading and marking up. For better or worse, a lot of folks who read a lot simply have technical or personal workflows that assume a PDF and/or ePub. In your date field, I would use "ISO" style date, with dashes not slashes (eg, |
Thanks for the feedback @bnewbold ! Comments inline to make
For your background, this is an annual report on web practices produced by a collection of over 100 people and compromising 20+ chapters. We have so far published a 2019 and 2020 edition and have started work on the 2021 edition. So not just a once-off report. Each chapter looks at a facet of web development (CSS, Javascript, Accessibility, Security...etc.) and is a well researched and data-backed analysis of the state of that chapter, by mining the HTTP Archive (crawls of over 7.5 million website home pages) and other resources. Each chapter is produced by separate authors (often well-known experts in each field) and data analysts. It was launched in late 2019 as a collection of 20 web pages for each chapter. Shortly afterwards we published a full PDF of the entire 2019 edition as an ebook download. Chapters of the report have been cited in multiple other scholarly article and mostly they cite the chapter in question. For example a web performance research article may cite the 2019 HTTP/2 chapter. Not sure if that influences your opinion on whether we should use Volume and Issue or not?
Interesting! Yes it seems from a quick search that there's no reference document for Highwire Press or other similar tag formats 😞 So have been concentrating on Google Scholar for now Still in two minds on adding meta tags for our translations though. As I say, the report is produced in English and then translated to 10+ languages - often only in part as we depend on volunteers for the translation. Only the 2019 Japanese edition is a complete translation and has therefore also been launched as a PDF Ebook download. Though a couple of languages (Japanese and Dutch) are close to finishing 2020 translation at which point they will also be made available as ebook PDF downloads. So to me the main references are the original English chapters.
As stated above, current citations have mostly been at chapter level. The website has full navigation options, including a table of contents page for each language and year, and a separate chapter table of contents for each chapter. The HTML pages are printable thanks to a print stylesheet hiding unnecessary content like headers and footer, but only the whole edition is made available as a downloadable PDF.
Yup already done, for 2019 (English), and 2020 (English), and also for 2019 (Japanese).
Google Scholar specifically states that format must be used:
|
Makes progress on #1325
I've added the meta data to the individual chapters, rather than the eBook as most citations are for a chapter with URL reference. This also avoids the issue of making the ebook HTML indexable (it's currently set to
noindex
as we prefer the PDF to be indexed, and it's VERY big to be downloaded regularly).Example of meta data:
** Edit: Updated as discussed below, see #2191 (comment) for the latest version **
Any thought on
citation_volume
? Should we include the Language? Or should we only have this Metadata for English language versions as they are the primary one and Google Scholar doesn't seem to have many options for translations, orcitation_language
or similar.