At Crossref and ROR, we develop and run processes that match metadata at scale, creating relationships between millions of entities in the scholarly record. Over the last few years, we’ve spent a lot of time diving into details about metadata matching strategies, evaluation, and integration. It is quite possibly our favourite thing to talk and write about! But sometimes it is good to step back and look at the problem from a wider perspective.
This year’s public data file is now available, featuring over 156 million metadata records deposited with Crossref through the end of April 2024 from over 19,000 members. A full breakdown of Crossref metadata statistics is available here.
Like last year, you can download all of these records in one go via Academic Torrents or directly from Amazon S3 via the “requester pays” method.
Download the file: The torrent download can be initiated here.
Earlier this year, we reported on the roundtable discussion event that we had organised in Frankfurt on the heels of the Frankfurt Book Fair 2023. This event was the second in the series of roundtable events that we are holding with our community to hear from you how we can all work together to preserve the integrity of the scholarly record - you can read more about insights from these events and about ISR in this series of blogs.
Crossref is undertaking a large program, dubbed 'RCFS' (Resourcing Crossref for Future Sustainability) that will initially tackle five specific issues with our fees. We haven’t increased any of our fees in nearly two decades, and while we’re still okay financially and do not have a revenue growth goal, we do have inclusion and simplification goals. This report from Research Consulting helped to narrow down the five priority projects for 2024-2025 around these three core goals:
To work out which version you’re on, take a look at the website address that you use to access iThenticate. If you go to ithenticate.com then you are using v1. If you use a bespoke URL, https://crossref-[your member ID].turnitin.com/ then you are using v2.
The Settings tab controls general, document, and report display options. These options include the number of documents shown for each page, default report view, and controlling email notifications.
General settings (v1)
Use General settings to set your home folder - this is the folder will open by default when you log in to iThenticate. Choose your home folder from the drop-down menu.
From the Number of documents to show drop-down, choose how many uploaded documents are listed in your folders before a new page is created.
Choose what is displayed after you upload a document to iThenticate: Display the upload folder (to see the processing of the document you have just uploaded), or Upload another document (returns you to the upload form).
You can also choose the time zone and language for your account - the language you choose will set the language of your user interface.
Click Update Settings to save your changes.
Documents settings (v1)
Use Documents settings to choose the default way iThenticate sorts your uploaded documents: by processed date, title, Similarity Score, and author. Choose your preferred option from the drop-down menu.
You can set the threshold at which the Similarity Score color changes, based on the percentage of similarity. All Similarity Scores above the percentage you set will appear in the folder in blue, all those beneath the percentage will appear in gray. This visual distinction helps you easily identify matches above a given threshold. Learn more about how to interpret the Similarity Score.
Click Update Settings to save your changes.
Reports settings (v1)
Use Reports settings to adjust your email notifications, choose whether to color-code your reports, and view available document repositories for your account.
Email notifications tell you when a Similarity Report has exceeded particular thresholds, including Similarity Reports in shared folders. Email notifications are sent to the email address you used to sign up to iThenticate.
Report email frequency: choose whether to receive notifications, chose how often you would like to receive them every hour, once a day, every other day, or once a week
Similarity Report threshold: this refers to a paper’s overall Similarity Score. If the Similarity Score of a paper in your account exceeds the threshold set, you will receive an email notification. The default setting is ‘don’t notify me’.
Content tracking report threshold: this refers to the All Sources section of the Similarity Report. If a single source for a paper in your account exceeds the similarity threshold set, you will receive an email notification. The default setting is don’t notify me.
Color code report: color-coding the Similarity Report can make viewing matches easier. Choose Yes or No to enable or disable this feature.
Available document repositories: this section shows the available repositories for your account. Modify them in the folder settings.
Page owner: Kathleen Luschek | Last updated 2020-May-19