Package 'doilinker'

Title: Link Preprints And Publications By DOI
Description: Links preprints to publications using the method described in Cabanac G, Oikonomidi T, Boutron I. "Day-to-day discovery of preprint-publication links". Scientometrics. 2021;1–20. DOI: 10.1007/s11192-021-03900-7.
Authors: Luke Zappia [aut, cre] (<https://orcid.org/0000-0001-7744-8565>, lazappi)
Maintainer: Luke Zappia <[email protected]>
License: GPL (>= 3)
Version: 0.1.1
Built: 2024-08-24 04:15:28 UTC
Source: https://github.com/lazappi/doilinker

Help Index


Calculate Jaccard similarity

Description

Calculate the Jaccard similarity between two strings. Strings are first tokenised, stop words are removed and tokens are stemmed.

Usage

calc_jaccard_similarity(string1, string2)

Arguments

string1

First string to score similarity

string2

Second string to score similarity

Value

Jaccard similarity score


Get Crossref authors string

Description

Convert the authors table from Crossref into a single string. It has the form "LastName1, FirstName1; LastName2, FirstName2;...".

Usage

get_cr_authors_str(cr_authors)

Arguments

cr_authors

Authors data.frame from a Crossref query

Value

Character vector with authors names


Get first author ORCiD

Description

Return the ORCiD for the first author of a reference.

Usage

get_first_orcid(cr_authors)

Arguments

cr_authors

Authors data.frame from a Crossref query

Value

Character vector with ORCiD or NA if not found


Is DOI preprint

Description

Check whether a DOI is from a preprint server.

Usage

is_doi_preprint(doi, container_title)

Arguments

doi

DOI to check

container_title

Whether or not there is a container title (journal name) associated with the DOI

Value

Logical whether the DOI is from a preprint


Is result a match

Description

Decide if a results from a DOI query is a match or not.

Usage

is_result_match(
  query_title,
  result_title,
  query_orcid,
  result_orcid,
  query_authors,
  result_authors
)

Arguments

query_title

Title of the query reference

result_title

Title of the result reference

query_orcid

ORCiD of the query first author

result_orcid

ORCiD of the result first author

query_authors

Authors string for the query

result_authors

Authors string for the result

Details

Result is a match if the similarity between titles is sufficiently high or if the similarity is lower by the first author ORCiDs match or the first author names match.

Value

Logical whether the result is a match or not


Same first author

Description

Check if two author strings have the same first author.

Usage

same_first_author(authors_str1, authors_str2)

Arguments

authors_str1

Character vector containing first authors list

authors_str2

Character vector containing second authors list

Details

Last names of the first authors are compared. If these match then first names are checked and TRUE is returned if either the whole first name or first initial matches. If first and last names cannot be separated the whole author names are compared. Some simplification of characters in names is performed to improve matches.

Value

Logical whether first authors are the same