Metodo

International Studies in Phenomenology and Philosophy

Series | Book | Chapter

226420

Semantic author name disambiguation with word embeddings

Mark-Christoph Müller

pp. 300-311

Abstract

We present a supervised machine learning AND system which tackles semantic similarity between publication titles by means of word embeddings. Word embeddings are integrated as external components, which keeps the model small and efficient, while allowing for easy extensibility and domain adaptation. Initial experiments show that word embeddings can improve the Recall and F score of the binary classification sub-task of AND. Results for the clustering sub-task are less clear, but also promising and overall show the feasibility of the approach.

Publication details

Published in:

Kamps Jaap, Tsakonas Giannis, Manolopoulos Yannis, Iliadis Lazaros, Karydis Ioannis (2017) Research and advanced technology for digital libraries: 21st international conference on theory and practice of digital libraries, TPDL 2017, Thessaloniki, Greece, September 18-21, 2017. Dordrecht, Springer.

Pages: 300-311

DOI: 10.1007/978-3-319-67008-9_24

Full citation:

Müller Mark-Christoph (2017) „Semantic author name disambiguation with word embeddings“, In: J. Kamps, G. Tsakonas, Y. Manolopoulos, L. Iliadis & I. Karydis (eds.), Research and advanced technology for digital libraries, Dordrecht, Springer, 300–311.