Compare Analytics Data Platforms - joshid43016/AnalyticsDataEcosystem GitHub Wiki
There is a plethora of analytics data platforms today and new are coming. At the same time, tools and services that connect to data platforms is also evolving. Making a decision on what works is difficult is dependent on individual situations as each platform has differentiators. The table below is to assist with this decision making. In your journey to modern data ecosystem, some of these could help in the process and de-risking in you are making a transition.
Integration with object data stores is key component and NO longer a differentiator. From AWS perspective, keeping data on AWS S3 is NOT a sufficient solution for analytics data platform (i.e. immutable data store, concurrent workload, optimal file size). While AWS Athena is great tool for interactive queries, it is NOT a data platform solution and I understand, this space is changing with introduction of AWS Lake formation which is still at its infancy.
NI - No information
| Description | Teradata | AWS | Snowflake | Databricks |
|---|---|---|---|---|
| Vantage | RedShift | Snowflake | Delta | |
| Cost | Predictable & tunable | Predictable & tunable | Variable & less tunable | Variable & less tunable |
| Performance - Query Response time | Predictable and tunable w/ Indexes | Predictable and tunable w/Indexes | Increase cluster size | Increase cluster size |
| Workload management | mature TASM | WLM | Isolate workload | NI |
| Limit - number of tables | No Limit | 40K per cluster | No Limit (slowness after 10K) | NI |
| Limit - concurrent session | Managed with TASM | Limited due to single host | New cluster after 8. Isolate workload | NI |
| Cloud Agnostic | Yes | No | Yes | Yes |
| DBA - Backup | Legacy | Snapshot | Modern | NI |
| DBA - Stats | Exists | Exists | Modern | NI |
| DBA - caring & feeding | High | High | Low | NI |
| Decoupled storage & compute architecture | No | No | Yes | Yes |
| Advanced Analytics - Geospatial | Yes | Limited | Yes | NI |
| Advanced Analytics - Graph | No | No | No | Yes |
| Semi-structured Data | Yes | Limited | Yes | NI |
| Security -Password management, RBAC | LDAP, SSO | Limited | Limited | NI |
| Connectivity | Rich. Direct MF, jdbc, odbc, dotNet | jdbc, odbc | jdbc, odbc | NI |
NAS Read & Write ~6 sec