Home - wtepfenhart/Link-Analysis GitHub Wiki
Welcome to the Link Analysis project. It's a simple little project to extract information from Wikipedia. In fact, the intent is to extract who, what, when, where, why, and how information from it - placing people at places and events. One goal is to create a world map with a date control to give a view of world events in terms of what was happening and where it happened, such as having a little pin showing on a map of the world which lets one know that George Washington gave his Second Inaugural Speech at this location on this particular date. An alternative use to to identify individuals who exist in a single degree of separation from one another. Maybe one wants to know the answer to the question, did Benjamin Franklin know Marie Antoinette?
We've had a number of students contribute to this project over the years. Some have contributed more than others, but the contributions of all are appreciated. More important to me is that they learn something while working on it. Keep in mind that the school year straddles two years with the Fall and Spring semesters falling in separate years, so I don't really have that many people working on the project at given time. One or two in fall and one or two in the spring.
Wall of Fame
2016 Contributors and Contributions
Contributors
- Yi Wang (MS CS 2016)
- Mary Menges (MS SE 2017)
Contributions
- Yi wrote the element scraper that extracts the content of a Wikipage into individual paragraphs for later processing.
- Mary helped set up some of the very earliest work on this project including writing the scripts for passing the outputs of the element scraper into Stanford CoreNLP.
2017 Contributors and Contributions
Contributors
- Mary Menges (MS SE 2017)
- Michael Sagan (MS SE 2018)
Contributions
- Mary wrote the program that encapsulates the 6 degrees of freedom to expand the query space from a given person to all those who are mentioned in conjunction with them.
2018 Contributors and Contributions
Contributors
- Todd Qualiano (MS SE 2019)
- KerryAnn DeMeester (MS SE 2019)
- Micheal Sagan (MS SE 2018)
Contributions
- Todd wrote a program to transform the output of the Stanford Core NLP from XML to CSV and has begun to reorganize all of the myriad bits and pieces of code that have generated over the years.
- KerryAnn is working on getting the data extracted from Wikipedia into the graph database, Neo4j.