🔥 40% Off Crucial Exams Memberships — Deal ends today!

3 hours, 34 minutes remaining!
Bash, the Crucial Exams Chat Bot
AI Bot

Data Storage and Databases (GCP PDE)  Flashcards

GCP Professional Data Engineer Flashcards

Bigtable indexing
Bigtable consistency model
Cloud Storage signed URLs
Provides temporary access to specific objects using a time-limited URL
Enables querying external data sources like Google Sheets or Cloud Storage files
BigQuery federated queries
BigQuery pricing model
Uses row keys for primary indexing with no built-in secondary indexes
Provides predictable costs by purchasing a dedicated amount of query processing capacity
Charges are based on storage and the amount of data processed during queries
BigQuery flat-rate pricing
Provides only eventual consistency for data writes and reads
FrontBack
Archive Storage ideal use caseBest for long-term storage of data rarely accessed
BigQuery clusteringOrganizes data using a field to optimize performance for specific query patterns
BigQuery data sharingUses datasets and authorized views to share data securely across projects
BigQuery export formatsSupports exporting data in formats like CSV, JSON, and Avro to Cloud Storage
BigQuery federated queriesEnables querying external data sources like Google Sheets or Cloud Storage files
BigQuery flat-rate pricingProvides predictable costs by purchasing a dedicated amount of query processing capacity
BigQuery integration with Looker StudioEnables interactive visualization and reporting for datasets
BigQuery optimizationUse partitioned or clustered tables for improved query performance
BigQuery partitioningHelps optimize query costs by organizing data based on a specific column like date
BigQuery pricing modelCharges are based on storage and the amount of data processed during queries
BigQuery reserved slotsOffers guaranteed computing resources to improve query performance
BigQuery scalabilitySupports petabytes of data with serverless querying
BigQuery slot-based pricingDetermines query performance based on the number of slots purchased
BigQuery use caseBest for analyzing large-scale datasets using SQL-like queries
Bigtable backup optionsSupports creating backups for tables in specific instances for point-in-time recovery
Bigtable consistency modelProvides only eventual consistency for data writes and reads
Bigtable data modelWide-column database optimized for sparse data
Bigtable indexingUses row keys for primary indexing with no built-in secondary indexes
Bigtable localityStores data physically adjacent based on row keys for faster access
Bigtable query limitationsRequires row-key optimization as it does not natively support complex joins or aggregations
Bigtable region distributionCan be deployed across multiple zones for disaster tolerance
Bigtable replicationUsed for availability and disaster recovery purposes
Bigtable row designDesign row keys to optimize data access patterns
Bigtable scalingAutomatically adjusts to handle increased throughput or storage without downtime
Bigtable use caseBest for low-latency operations on large-scale time-series data
Cloud Datastore vs BigtableDatastore is better for transactional consistency, while Bigtable is better for analytics and high throughput
Cloud Firestore use caseBest for mobile and web applications requiring offline support and real-time synchronization
Cloud Firestore vs DatastoreFirestore provides advanced querying and offline support, while Datastore offers simpler APIs
Cloud Spanner use caseBest for horizontally scalable relational databases with strong consistency requirements
Cloud SQL backupsSupports automated and on-demand backups for disaster recovery
Cloud SQL replication typesSupports both asynchronous and synchronous replication for high-availability scenarios
Cloud SQL use caseBest for relational databases requiring compatibility with MySQL, PostgreSQL, or SQL Server
Cloud Storage data lifecycle managementPolicies automatically delete or transition objects between tiers based on age
Cloud Storage IAMUsed for granular control over who can access and manage data
Cloud Storage scalabilityAutomatically scales to handle large amounts of unstructured data
Cloud Storage signed URLsProvides temporary access to specific objects using a time-limited URL
Cloud Storage tiersStandard, Nearline, Coldline, and Archive
Cloud Storage use caseBest for storing unstructured data like images, videos, and backups
Cloud Storage vs Persistent DiskCloud Storage is object storage, while Persistent Disk is block storage attached to VMs
Coldline Storage ideal use caseBest for data accessed less than once a year
Datastore access controlUses IAM policies to define permissions at project, entity group, or key levels
Datastore indexingAutomatically indexes properties for queries but allows custom index configuration
Datastore queriesSupport strong or eventual consistency depending on query type
Datastore relationship modelingSupports nested entities and ancestor query patterns
Datastore transactionsAllow atomicity for multiple operations across multiple entities
Datastore use caseBest for scalable NoSQL applications and transactional workloads
Nearline Storage ideal use caseBest for data accessed less than once a month but more than once a year
Streaming data to BigQuery use caseBest for real-time analytics and dashboards
Front
Cloud Storage data lifecycle management
Click the card to flip
Back
Policies automatically delete or transition objects between tiers based on age
Front
Archive Storage ideal use case
Back
Best for long-term storage of data rarely accessed
Front
Bigtable backup options
Back
Supports creating backups for tables in specific instances for point-in-time recovery
Front
Cloud SQL replication types
Back
Supports both asynchronous and synchronous replication for high-availability scenarios
Front
Cloud Firestore use case
Back
Best for mobile and web applications requiring offline support and real-time synchronization
Front
Cloud Storage scalability
Back
Automatically scales to handle large amounts of unstructured data
Front
Cloud Spanner use case
Back
Best for horizontally scalable relational databases with strong consistency requirements
Front
Cloud Storage signed URLs
Back
Provides temporary access to specific objects using a time-limited URL
Front
Bigtable scaling
Back
Automatically adjusts to handle increased throughput or storage without downtime
Front
BigQuery partitioning
Back
Helps optimize query costs by organizing data based on a specific column like date
Front
Datastore use case
Back
Best for scalable NoSQL applications and transactional workloads
Front
Cloud Firestore vs Datastore
Back
Firestore provides advanced querying and offline support, while Datastore offers simpler APIs
Front
BigQuery slot-based pricing
Back
Determines query performance based on the number of slots purchased
Front
Bigtable indexing
Back
Uses row keys for primary indexing with no built-in secondary indexes
Front
Datastore transactions
Back
Allow atomicity for multiple operations across multiple entities
Front
BigQuery reserved slots
Back
Offers guaranteed computing resources to improve query performance
Front
Bigtable row design
Back
Design row keys to optimize data access patterns
Front
BigQuery export formats
Back
Supports exporting data in formats like CSV, JSON, and Avro to Cloud Storage
Front
Bigtable query limitations
Back
Requires row-key optimization as it does not natively support complex joins or aggregations
Front
Cloud Storage vs Persistent Disk
Back
Cloud Storage is object storage, while Persistent Disk is block storage attached to VMs
Front
Datastore queries
Back
Support strong or eventual consistency depending on query type
Front
Bigtable consistency model
Back
Provides only eventual consistency for data writes and reads
Front
Cloud SQL backups
Back
Supports automated and on-demand backups for disaster recovery
Front
Cloud Storage use case
Back
Best for storing unstructured data like images, videos, and backups
Front
BigQuery flat-rate pricing
Back
Provides predictable costs by purchasing a dedicated amount of query processing capacity
Front
Coldline Storage ideal use case
Back
Best for data accessed less than once a year
Front
Bigtable replication
Back
Used for availability and disaster recovery purposes
Front
BigQuery pricing model
Back
Charges are based on storage and the amount of data processed during queries
Front
BigQuery clustering
Back
Organizes data using a field to optimize performance for specific query patterns
Front
BigQuery use case
Back
Best for analyzing large-scale datasets using SQL-like queries
Front
Streaming data to BigQuery use case
Back
Best for real-time analytics and dashboards
Front
Datastore access control
Back
Uses IAM policies to define permissions at project, entity group, or key levels
Front
Cloud Storage tiers
Back
Standard, Nearline, Coldline, and Archive
Front
Bigtable data model
Back
Wide-column database optimized for sparse data
Front
BigQuery optimization
Back
Use partitioned or clustered tables for improved query performance
Front
Datastore relationship modeling
Back
Supports nested entities and ancestor query patterns
Front
Bigtable use case
Back
Best for low-latency operations on large-scale time-series data
Front
BigQuery integration with Looker Studio
Back
Enables interactive visualization and reporting for datasets
Front
BigQuery data sharing
Back
Uses datasets and authorized views to share data securely across projects
Front
Bigtable region distribution
Back
Can be deployed across multiple zones for disaster tolerance
Front
Cloud Datastore vs Bigtable
Back
Datastore is better for transactional consistency, while Bigtable is better for analytics and high throughput
Front
BigQuery scalability
Back
Supports petabytes of data with serverless querying
Front
Datastore indexing
Back
Automatically indexes properties for queries but allows custom index configuration
Front
Cloud SQL use case
Back
Best for relational databases requiring compatibility with MySQL, PostgreSQL, or SQL Server
Front
BigQuery federated queries
Back
Enables querying external data sources like Google Sheets or Cloud Storage files
Front
Bigtable locality
Back
Stores data physically adjacent based on row keys for faster access
Front
Nearline Storage ideal use case
Back
Best for data accessed less than once a month but more than once a year
Front
Cloud Storage IAM
Back
Used for granular control over who can access and manage data
1/48
This deck focuses on GCP storage solutions, including Cloud Storage, Bigtable, Datastore, BigQuery, and how to choose the right storage option for a given use case.
Share on...
Follow us on...