Project Status - DDMAL/linkedmusic-datalake GitHub Wiki

Completed Work

A visualization of the current complete LinkedMusic data lake ontology can be found here.

As of 20 August 2025, we have finished ingesting the following databases to the LinkedMusic data lake, a total of 383,809,033 RDF triples:

Note: As our workflow becomes more streamlined, these databases would benefit from additional passes of the full ingestion pipeline

Datasets in-progress

Datasets to ingest in the near future

Additional tasks

  1. Develop the NLQ2SPARQL tool (SESEMMI)
  1. Upload new entities and properties to Wikidata
  • Refer to Wikidata Uploading (Feast Day Project) for our current approach to uploading Wikidata, starting with Feasts
  • Refer to Wikidata: Things we should add for other categories of items that would be useful for our purposes if they were on Wikidata

    Note: Dataset-specific documentation may have a partial list of specific items missing from Wikidata

  1. Develop the front end for LinkedMusic