Bash, the Crucial Exams Chat Bot
AI Bot

Big Data and Analytics Services  Flashcards

AWS Certified Data Engineer Associate DEA-C01 Flashcards

FrontBack
AthenaAthena is a serverless query service to analyze data directly in Amazon S3 using SQL.
Athena ConfigurationRequires creating and connecting to a database/table with data stored in S3.
Athena PricingPriced per query based on the volume of data scanned in bytes.
Athena Use CaseUsed for ad-hoc analysis of structured data stored in S3 without the need for ETL processes.
EMRAmazon EMR is used for big data processing and analysis using Apache Spark, Hadoop, and other frameworks.
EMR ConfigurationRequires cluster setup with instances, instance groups, applications, and job flow steps.
EMR PricingPriced based on the number and type of EC2 instances in use by the cluster.
EMR Use CaseIdeal for running distributed frameworks like Spark and Hadoop for processing large datasets.
KinesisAmazon Kinesis processes real-time data streams for analytics and applications.
Kinesis ConfigurationRequires setting up streams, shards, producers, and consumers.
Kinesis PricingCharged based on the number of shards and data throughput/processing.
Kinesis Use CaseBest for real-time video analytics, log processing, IoT data streams, and metric monitoring.
QuickSightAmazon QuickSight is a business intelligence service to visualize data and create dashboards.
QuickSight ConfigurationRequires data source connections, importing datasets, and defining visualizations.
QuickSight PricingCharged per user and type of access: Standard or Enterprise Edition.
QuickSight Use CasePerfect for creating interactive visualizations and sharing business reports.
This deck emphasizes services such as EMR, Athena, Kinesis, and QuickSight, including their use cases and configurations for big data processing and analytics.
Share on...
Follow us on...