Bash, the Crucial Exams Chat Bot
AI Bot
Big Data and Analytics Services Flashcards
Front | Back |
Athena | Athena is a serverless query service to analyze data directly in Amazon S3 using SQL. |
Athena Configuration | Requires creating and connecting to a database/table with data stored in S3. |
Athena Pricing | Priced per query based on the volume of data scanned in bytes. |
Athena Use Case | Used for ad-hoc analysis of structured data stored in S3 without the need for ETL processes. |
EMR | Amazon EMR is used for big data processing and analysis using Apache Spark, Hadoop, and other frameworks. |
EMR Configuration | Requires cluster setup with instances, instance groups, applications, and job flow steps. |
EMR Pricing | Priced based on the number and type of EC2 instances in use by the cluster. |
EMR Use Case | Ideal for running distributed frameworks like Spark and Hadoop for processing large datasets. |
Kinesis | Amazon Kinesis processes real-time data streams for analytics and applications. |
Kinesis Configuration | Requires setting up streams, shards, producers, and consumers. |
Kinesis Pricing | Charged based on the number of shards and data throughput/processing. |
Kinesis Use Case | Best for real-time video analytics, log processing, IoT data streams, and metric monitoring. |
QuickSight | Amazon QuickSight is a business intelligence service to visualize data and create dashboards. |
QuickSight Configuration | Requires data source connections, importing datasets, and defining visualizations. |
QuickSight Pricing | Charged per user and type of access: Standard or Enterprise Edition. |
QuickSight Use Case | Perfect for creating interactive visualizations and sharing business reports. |
Front
EMR Use Case
Click the card to flip
Back
Ideal for running distributed frameworks like Spark and Hadoop for processing large datasets.
Front
EMR Configuration
Back
Requires cluster setup with instances, instance groups, applications, and job flow steps.
Front
QuickSight Use Case
Back
Perfect for creating interactive visualizations and sharing business reports.
Front
Athena Configuration
Back
Requires creating and connecting to a database/table with data stored in S3.
Front
QuickSight Pricing
Back
Charged per user and type of access: Standard or Enterprise Edition.
Front
EMR
Back
Amazon EMR is used for big data processing and analysis using Apache Spark, Hadoop, and other frameworks.
Front
Kinesis
Back
Amazon Kinesis processes real-time data streams for analytics and applications.
Front
Athena
Back
Athena is a serverless query service to analyze data directly in Amazon S3 using SQL.
Front
EMR Pricing
Back
Priced based on the number and type of EC2 instances in use by the cluster.
Front
Kinesis Use Case
Back
Best for real-time video analytics, log processing, IoT data streams, and metric monitoring.
Front
Athena Pricing
Back
Priced per query based on the volume of data scanned in bytes.
Front
QuickSight Configuration
Back
Requires data source connections, importing datasets, and defining visualizations.
Front
QuickSight
Back
Amazon QuickSight is a business intelligence service to visualize data and create dashboards.
Front
Kinesis Pricing
Back
Charged based on the number of shards and data throughput/processing.
Front
Kinesis Configuration
Back
Requires setting up streams, shards, producers, and consumers.
Front
Athena Use Case
Back
Used for ad-hoc analysis of structured data stored in S3 without the need for ETL processes.
1/16
This deck emphasizes services such as EMR, Athena, Kinesis, and QuickSight, including their use cases and configurations for big data processing and analytics.