Compare Analytics Data Platforms - joshid43016/AnalyticsDataEcosystem GitHub Wiki

There is a plethora of analytics data platforms today and new are coming. At the same time, tools and services that connect to data platforms is also evolving. Making a decision on what works is difficult is dependent on individual situations as each platform has differentiators. The table below is to assist with this decision making. In your journey to modern data ecosystem, some of these could help in the process and de-risking in you are making a transition.

Integration with object data stores is key component and NO longer a differentiator. From AWS perspective, keeping data on AWS S3 is NOT a sufficient solution for analytics data platform (i.e. immutable data store, concurrent workload, optimal file size). While AWS Athena is great tool for interactive queries, it is NOT a data platform solution and I understand, this space is changing with introduction of AWS Lake formation which is still at its infancy.

NI - No information

Description Teradata AWS Snowflake Databricks
Vantage RedShift Snowflake Delta
Cost Predictable & tunable Predictable & tunable Variable & less tunable Variable & less tunable
Performance - Query Response time Predictable and tunable w/ Indexes Predictable and tunable w/Indexes Increase cluster size Increase cluster size
Workload management mature TASM WLM Isolate workload NI
Limit - number of tables No Limit 40K per cluster No Limit (slowness after 10K) NI
Limit - concurrent session Managed with TASM Limited due to single host New cluster after 8. Isolate workload NI
Cloud Agnostic Yes No Yes Yes
DBA - Backup Legacy Snapshot Modern NI
DBA - Stats Exists Exists Modern NI
DBA - caring & feeding High High Low NI
Decoupled storage & compute architecture No No Yes Yes
Advanced Analytics - Geospatial Yes Limited Yes NI
Advanced Analytics - Graph No No No Yes
Semi-structured Data Yes Limited Yes NI
Security -Password management, RBAC LDAP, SSO Limited Limited NI
Connectivity Rich. Direct MF, jdbc, odbc, dotNet jdbc, odbc jdbc, odbc NI

NAS Read & Write ~6 sec