Topic recommendation using Doc2Vec
2018 (English)In: 2018 International Joint Conference on Neural Networks (IJCNN), Institute of Electrical and Electronics Engineers (IEEE) , 2018Conference paper, Published paper (Refereed)
Abstract [en]
The ever-increasing number of electronic content stored in digital libraries requires a significant amount of effort in cataloguing and has led to self-deposit solutions where the authors submit and publish their own digital records. Even in self-deposit, going through the abstract and assigning subject terms or keywords is a time consuming and expensive process, yet crucial for the metadata quality of the record that affects retrieval. Therefore, an automatic, or even a semi-automatic process that can recommend topics for a new entry is of huge practical value. A system that can address that has to rely basically on two components, one component for efficiently representing the relevant information of the new document and one component for recommending an appropriate set of topics based on the representation of the previous stage. In this work, different candidate solutions for both components are investigated and compared. For the first stage both distributed Document to Vector (doc2vec) and conventional Bag of Words (BoW) components are employed, while for the latter two different transformation approaches from the field of multi-label classification are compared. For the comparison, a collection of Ph.D. abstracts (~19000 documents) from the MIT Libraries Dspace repository is used suggesting that different combinations can provide high quality solutions.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2018.
Keywords [en]
Recommender system, multilabel classification, word2vec, doc2vec, bag of words
National Category
Computer Sciences Information Studies
Research subject
Control Engineering
Identifiers
URN: urn:nbn:se:ltu:diva-71543DOI: 10.1109/IJCNN.2018.8489513ISI: 000585967404126Scopus ID: 2-s2.0-85056491792ISBN: 978-1-5090-6014-6 (electronic)OAI: oai:DiVA.org:ltu-71543DiVA, id: diva2:1262483
Conference
2018 International Joint Conference on Neural Networks (IJCNN), Rio de Janeiro, Brazil, 8-13 July, 2018
2018-11-122018-11-122026-02-13Bibliographically approved