352 Research ‐ Domain Analysis - bounswe/bounswe2024group1 GitHub Wiki

Domain Analysis

The process of analyzing related software systems in a domain to find their common and variable parts

Overview

We must know about the tools we will be using along the way, so this page is created as a result of comprehensive research. Any team member can consult this page whenever they need, and add new information as they find missing parts.

Concepts

Wikidata

Wikidata is a free, collaborative, multilingual, secondary knowledge base, collecting structured data to provide support for Wikipedia, Wikimedia Commons, the other wikis of the Wikimedia movement, and to anyone in the world. The Wikidata repository consists mainly of items, each one having a label, a description and any number of aliases. Items are uniquely identified by a Q followed by a number, such as Cuisine (Q1778821).

All the data can be copied and used without any requirement for permission.
Data is entered and maintained by many Wikidata editors
Data entered in any language is immediately available in all other languages, editing in any language is possible.
Data is highly structured and organized, making it possible for computers and programs to "understand" it.
We are mostly using Reasonator to search for Wikidata items.

Linked Data

Structured data which is interlinked with other data so it becomes more useful through semantic queries. It is not only for human users, the structure of the data can be read automatically by computers. The main principles are:

Uniform Resource Identifiers (URIs) should be used to name and identify individual things.
HTTP URIs should be used to allow these things to be looked up, interpreted, and subsequently "dereferenced".
Useful information about what a name identifies should be provided through open standards such as RDF, SPARQL, etc.
When publishing data on the Web, other things should be referred to using their HTTP URI-based names.

Uniform Resource Identifiers

A unique sequence of characters that identifies an abstract or physical resource, such as resources on a webpage, mail address, phone number, books, real-world objects such as people and places, or concepts. Examples:

International Standard Book Number (ISBN) system, ISBN 0-486-27557-4 identifies a specific edition of Shakespeare's play Romeo and Juliet.
Uniform Resource Locator (URL), the URL http://example.org/wiki/Main_Page

SPARQL Semantic Queries

An RDF query language, that is, a semantic query language for databases. Wikidata provides an SPARQL endpoint including a powerful Web-GUI. With SPARQL you can extract any kind of data, with a query composed of logical combinations of triples. SPARQL query service GUI is here

Sources

Wikidata
Linked Data
URI
SPARQL
So You Think You Know Git - an excellent talk by a cocreator of GitHub