TY - GEN
T1 - LD Connect: A Linked Data Portal for IOS Press Scientometrics
AU - Liu, Zilong
AU - Shi, Meilin
AU - Janowicz, Krzysztof
AU - Regalia, Blake
AU - Delbecque, Stephanie
AU - Mai, Gengchen
AU - Zhu, Rui
AU - Hitzler, Pascal
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - In this work, we describe a Linked Data portal, LD Connect, which operates on all bibliographic data produced by IOS Press over the past thirty-five years, including more than a hundred thousand papers, authors, affiliations, keywords, and so forth. However, LD Connect is more than just an RDF-based metadata set of bibliographic records. For example, all affiliations are georeferenced, and co-reference resolution has been performed on organizations and contributors including both authors and editors. The resulting knowledge graph serves as a public dataset, web portal, and query endpoint, and it acts as a data backbone for IOS Press and various bibliographic analytics. In addition to the metadata, LD Connect is also the first portal of its kind that publicly shares document embeddings computed from the full text of all papers and knowledge graph embeddings based on the graph structure, thereby enabling semantic search and automated IOS Press scientometrics. These scientometrics run directly on top of the graph and combine it with the learned embeddings to automatically generate data visualizations, such as author and paper similarity over all journals. By making the involved ontologies, embeddings, and scientometrics all publicly available, we aim to share LD Connect services with not only the Semantic Web community but also the broader public to facilitate research and applications based on this large-scale academic knowledge graph. Particularly, the presented scientometric system generalizes beyond IOS Press data and can be deployed on top of other bibliographic datasets as well.
AB - In this work, we describe a Linked Data portal, LD Connect, which operates on all bibliographic data produced by IOS Press over the past thirty-five years, including more than a hundred thousand papers, authors, affiliations, keywords, and so forth. However, LD Connect is more than just an RDF-based metadata set of bibliographic records. For example, all affiliations are georeferenced, and co-reference resolution has been performed on organizations and contributors including both authors and editors. The resulting knowledge graph serves as a public dataset, web portal, and query endpoint, and it acts as a data backbone for IOS Press and various bibliographic analytics. In addition to the metadata, LD Connect is also the first portal of its kind that publicly shares document embeddings computed from the full text of all papers and knowledge graph embeddings based on the graph structure, thereby enabling semantic search and automated IOS Press scientometrics. These scientometrics run directly on top of the graph and combine it with the learned embeddings to automatically generate data visualizations, such as author and paper similarity over all journals. By making the involved ontologies, embeddings, and scientometrics all publicly available, we aim to share LD Connect services with not only the Semantic Web community but also the broader public to facilitate research and applications based on this large-scale academic knowledge graph. Particularly, the presented scientometric system generalizes beyond IOS Press data and can be deployed on top of other bibliographic datasets as well.
KW - Document embeddings
KW - Knowledge graph embeddings
KW - Knowledge graphs
KW - LD Connect
KW - Ontology engineering
KW - Scientometrics
UR - https://www.scopus.com/pages/publications/85131955539
U2 - 10.1007/978-3-031-06981-9_19
DO - 10.1007/978-3-031-06981-9_19
M3 - Contribution to proceedings
SN - 978-3-031-06980-2
T3 - Lecture Notes in Computer Science
SP - 323
EP - 337
BT - The Semantic Web - 19th International Conference, ESWC 2022, Proceedings
A2 - Groth, Paul
A2 - Vidal, Maria-Esther
A2 - Suchanek, Fabian
A2 - Szekley, Pedro
A2 - Kapanipathi, Pavan
A2 - Pesquita, Catia
A2 - Skaf-Molli, Hala
A2 - Tamper, Minna
PB - Springer
CY - Cham
T2 - 19th European Semantic Web Conference (ESWC 2022)
Y2 - 29 May 2022 through 2 June 2022
ER -