NoSQL - ilya-khadykin/notes-outdated GitHub Wiki

NoSQL vs relational databases

NoSQL db	Relational dbs
more flexible for schema changes	SQL was designed to be a query language for relational databases
many NoSQL dbs allow definition of fields on record creation	relation databases are usually table-based, almost like spreadsheets
nested values are common in NoSQL databases	records stored in rows; columns represent fields in rows
fields are not standardized between records	SQL queries within or between tables in relation database

document stores:

key-value stores:

you have a key you can query by, and the value at that key (you usually can't query by anything other that key)
some key-value store let you define more than one key;
sometimes used alongside relational databases for caching

BigTable/tabular:

graph databases:

designed for data best represented as interconnected nodes (a series of road intersections);

object databases:

Document db written in Erlang

Document db which uses JavaScript

Notes:

querying is not done over HTTP (in comparison with CouchDB)
native drivers for each language
does not support CouchDB-style views
only master/slave replication: only master copies can write data
consistent, partition-tolerant db
- all users always get the same data back from MongoDB
- documents are partitioned using sharding
- each partion will have a subset of the records
- shards are created based on key you choose (allows you customize how MongoDB partions the db)

Originally developed by Facebook

Notes:

querying not over HTTP
native driver for each language
cross between key/value store and tabular database
available, partition-tolerant db:
- you should always be able to read from and write to Cassandra
- hardware nodes can be added with no downtime
- consistency can be adjusted, although this will affect the availability

Document db written in Erlang

Notes:

MapReduce functions can be written in Erlang as well as JavaScript
designed primarily to work on Mac and Linux
available, partition-tolerant db:
- you should always be able to read from and write to Riak
- hardware nodes can be added easily

key/value store

Notes:

querying not over HTTP
native drivers for each language
designed primarily to work on Mac and Linux (does not have Windows support)
master/slave replication
consistent, partition-tolerant db:
- each user should always get the same data back from Redis
- writing directly to a slave is possible, but violates consistency
- data replicated to multiple slaves

queries primarily by key
specific values from hashes within records can be retrieved
value does not have to be a string, unlike many key/value stores
lists, sets, and hashes of strings
- lists are lists of strings
- hashes are further key/value pairs
- sets are non-repeating values