SEMAGROW engine - big-data-europe/README GitHub Wiki

Website http://semagrow.github.io/
Supported versions 1.4.1
Current responsible(s) Yiannis Mouchakis @ NCSR-D -- [email protected]
Docker image(s) semagrow/semagrow
More info https://github.com/semagrow/semagrow

Short description

Semagrow is a Query Federation engine that is both algorithmically sophisticated and well-engineered and has been shown to outperform all other federated querying engines on both standard benchmarks and our own use cases.

Semagrow allows combining, cross-indexing and, in general, making the best out of all public data, regardless of their size, update rate, and schema: Semagrow offers a single SPARQL endpoint that serves data from remote data sources and that hides from client applications heterogeneity in both form (federating non-SPARQL endpoints) and meaning (transparently mapping queries and query results between vocabularies).

The Semagrow query planner exploits metadata about the nodes of the federation to optimize query execution. Semagrow allows full flexibility on the level of detail of this metadata, and exhibits pay-as-you-go behaviour where robustness to lack of detail and accuracy of the metadata is matched with the quality of the optimization in the presence of detailed and accurate metadata.

Example usage

To run Semagrow issue

docker run -p <host_port>:8080 semagrow/semagrow

Then you can access it from

http://localhost:<host_port>/SemaGrow/

Semagrow configuration files are located at

/etc/default/semagrow

to add external configuration files run Semagrow as

docker run -p <host_port>:8080 -v <local_dir>:/etc/default/semagrow semagrow/semagrow

Scaling

To add a new node to the Semagrow federation follow the instructions at http://semagrow.github.io/configuration/

⚠️ **GitHub.com Fallback** ⚠️