Redis Services Interface - Sotera/watchman GitHub Wiki

Purpose: describes input/output formats for all Redis-based services.

Input

  1. state == "new"
  2. job_id == id of the job being run
  3. other service-specific attrs

Output

  1. attributes: state, data
  2. state == "processed" when complete
  3. state == "error" when error
  4. error == error msg if there is an error. otherwise nil or not present.
  5. data: an array of feature values, or null if error. Set an empty array to represent processed but no results

Cosine Similarity specific input

  1. query_url == the endpoint to query for data/scrolling
  2. result_url == the endpoint to post a result
  3. similarity_threshold
  4. similarity_method == custom or falconn
  5. start_time_ms == timestamp in millis of the window start time
  6. end_time_ms == timestamp in millis of the window end time
  7. lang == en ..?
  8. data_type == text or image

Image Feature Extraction specific input

  1. urls == a stringified json object with a urls property which is equal to the image_urls field in the social media post object

Redis channels

N.B. prefix channel names with 'genie:'

###Featurize Creation

  1. Text Syntax - feature_txt
  2. Images - feature_img

###Cluster Creation

  1. Text Syntax Clustering - clust_txt
  2. Hashtag Clustering - clust_hash
  3. Image Clustering - clust_img

Testing

  1. job should fail when service-specific attrs are missing and provide error details.
  2. job should fail if internal network calls, file ops, or any I/O fails.

TODO

  1. create system-wide unique redis keys.
  2. decide on a consistent system-testing strategy for all service developers.