ATTENTION: The package is not working as of 2017-11-26. The Lattes website, where the xml files were available, is offline.
Lattes is an unique and largest platform for academic curriculumns. There you can find information about the academic work of all Brazilian scholars. It includes institution of PhD, current employer, field of work, all publications metadata and more. It is an unique and reliable source of information for bibliometric studies.
I’ve been working with Lattes data for some time. Here I present a short list of papers that have used this data.
The Brazilian scientific output published in journals: A study based on a large CV database
Análise do Perfil dos Acadêmicos e de suas Publicações Científicas em Administração (in Portuguese)
Is predatory publishing a real threat? Evidence from a large database study. Working paper
Package GetLattesData
is a wrap up of functions I’ve been using for accessing the dataset. It’s main innovation is the possibility of downloading data directly from Lattes, without any manual work or captcha solving.
The package is available in CRAN:
install.packages('GetLattesData')
You can also install the development version from Github:
#install.packages('devtools')
devtools::install_github('msperlin/GetLattesData')
See vignette for more examples.
library(GetLattesData)
# ids from EA-UFRGS
my.ids <- c('K4713546D3', 'K4440252H7',
'K4783858A0', 'K4723925J2')
# qualis for the field of management
field.qualis = 'ADMINISTRAÇÃO PÚBLICA E DE EMPRESAS, CIÊNCIAS CONTÁBEIS E TURISMO'
l.out <- gld_get_lattes_data(id.vec = my.ids, field.qualis = field.qualis)
tpublic <- l.out$tpublic
dplyr::glimpse(tpublic)