Masters Final Oral: Dipit Malhotra

Monday, June 12, 2017 - 1:00pm
223 Atanasoff Hall
Event Type: 

Title: Web Based Application for Extracting, Storing, and Analyzing Research Publication Data
Date/Time: June 12th, 2017 @ 1:00 PM
Place: 223 Atanasoff Hall 
Major Professor: Carl Chang
Committee Members: Samik Basu and Simanta Mitra


Information related to research publications is available all over the internet. Search engines like Google Scholar, ACM, Google Books allows user to search information related to published research article. However, there is a lack of free centralized platform that allows the user to search with multiple queries a day. All the above mentioned platform possesses a certain number of queries allowed for each user. There is a need to develop a platform that can take care of this issue as well as provide the user with accurate information. To accomplish this, we need to have a massive database of all the scholarly material available on the internet. This can be done by scrapping out all the citations from the research paper pdf and storing the information related to the citation in JSON BibTex. So for every single pdf that the user studies, the citations mentioned in the pdf are exported to the database with their respective information extracted from available APIs. We provide a framework that automates the process of importing citation data into a database. Lastly, we also developed REST API’s that shall be useful for retrieving information about scholarly material published with a DOI and shall serve as a good resource for building third-party applications.