Redis Services Interface - Sotera/watchman GitHub Wiki
Purpose: describes input/output formats for all Redis-based services.
Input
- state == "new"
- job_id == id of the job being run
- other service-specific attrs
Output
- attributes: state, data
- state == "processed" when complete
- state == "error" when error
- error == error msg if there is an error. otherwise nil or not present.
- data: an array of feature values, or null if error. Set an empty array to represent processed but no results
Cosine Similarity specific input
- query_url == the endpoint to query for data/scrolling
- result_url == the endpoint to post a result
- similarity_threshold
- similarity_method == custom or falconn
- start_time_ms == timestamp in millis of the window start time
- end_time_ms == timestamp in millis of the window end time
- lang == en ..?
- data_type == text or image
Image Feature Extraction specific input
- urls == a stringified json object with a urls property which is equal to the image_urls field in the social media post object
Redis channels
N.B. prefix channel names with 'genie:'
###Featurize Creation
- Text Syntax - feature_txt
- Images - feature_img
###Cluster Creation
- Text Syntax Clustering - clust_txt
- Hashtag Clustering - clust_hash
- Image Clustering - clust_img
Testing
- job should fail when service-specific attrs are missing and provide error details.
- job should fail if internal network calls, file ops, or any I/O fails.
TODO
- create system-wide unique redis keys.
- decide on a consistent system-testing strategy for all service developers.