PRAGMA ENT Data Node connection test and information retrieval - pragmagrid/pragma_ent GitHub Wiki

PRAGMA-ENT Data Node connection test and information retrieval

IU Node, one site of PRAGMA-ENT mesh, has hosted a persistent data store which makes available to use. The store contains digitalized books (OCR) from university research libraries. Current IU data store has open Hathitrust corpus which contains 679 volumes, 154,700 OCI digitalized book pages. The average time to retrieve one volume from 10 runs is 13.797 ms that is optimized through replication set strategy. For access control and data security, IU node uses Linux iptables firewall to control incoming and outgoing traffic from certain IP address on mongoDB port 27017. We also enable access control to mongoDB instance by requiring that users identify themselves using username and password pair.

A general architecture graph is as below:

alt tag

Prerequirements

Install MongoDB on your test node; Please refer to https://docs.mongodb.org/manual/tutorial/install-mongodb-on-red-hat/

Email IU Data Node admin with your IP address (public IP and ent IP).

Connection Test and Information Retrieval

1/ Write a simple bash script query to retrieve one volume from corpus

> vi script.js

Add the below 2 lines:

use openhathitrust
db.volumerecords.find({"volumename":"nnc2.2187924.0001.001"})

2/ Execute this simple query script using both public access and ent access (1) Public network access

>mongo pragma8.cs.indiana.edu:27017/openhathitrust -u pragmaent -p pragma8iu < script.js

(2) ENT network access

>mongo 10.100.4.1:27017/openhathitrust -u pragmaent -p pragma8iu < script.js

Note: The user “pragmaent” only has READ permission of openhathitrust corpus. Please contact IU data node admin if you require more permission.

Release History

  • 0.0.1 Initial release

Contact Info