Požega, Marko and Poljak, Dario and Kocijan, Kristina.
(2016).
Building Scholarly Data Forest.
In: Semantics, Analytics, Visualization. Enhancing Scholarly Data. SAVE-SD 2016, 11.04.2016., Montreal, Canada.
Abstract
In this paper, we will demonstrate syntactic analysis and visualization of scientific data, namely references from scientific papers. Our main goal is to build a parser which could extract references from scientific papers, convert them to XML format, send to custom visualization algorithm and present in a web interface as a ReferenceTree for a single author. For this process, we use several different technologies such as NLP software NooJ, programming languages PHP and JavaScript in combination with HTML5. Our main problem was dissimilarity in reference styles between articles. Thus, our parser was designed to recognize different reference source (book, paper, web page) in APA, MLA and Chicago reference styles. As for the visualization idea, we have chosen the concept of presenting an author as a tree, the publication years as the main branches, the articles/books as twigs and references used in each article/book as the leaves. The books are grouped on the left side of the tree while the articles are grouped on the right side. With final output, every processed author should have a unique tree (preferences of references) and could be compared with the rest of the scientific forest.
Item Type: |
Published conference work
(Lecture)
|
Related URLs: |
|
Uncontrolled Keywords: |
scholarly data, network visualization, contact trees, egocentric networks, NLP, APA, MLA Chicago reference style, science mapping, ReferenceTree |
Subjects: |
Information sciences |
Departments: |
Department of Information Science |
Date Deposited: |
02 Jun 2017 10:50 |
Last Modified: |
31 Dec 2018 00:15 |
URI: |
http://darhiv.ffzg.unizg.hr/id/eprint/8838 |
Actions (login required)
|
View Item |