Category: Data Operations and Support
A data engineer needs to build an extract, transform, and load (ETL) job. The ETL job will process daily incoming .CSV files that users upload to an A...
Category: Data Operations and Support
A data engineer needs to create an AWS Lambda function that converts the format of data from .CSV to Apache Parquet. The Lambda function must run only...
Category: Data Ingestion and Transformation
A data engineer needs to use AWS Step Functions to design an orchestration workflow. The workflow must parallel process a large collection of data fil...
Category: Data Ingestion and Transformation
A company is migrating a legacy application to an Amazon S3 based data lake. A data engineer reviewed data that is associated with the legacy applicat...
Category: Data Operations and Support
A company uses an on-premises Microsoft SQL Server database to store financial transaction data. The company migrates the transaction data from the on...
Category: Data Store Management
A data engineer must manage the ingestion of real-time streaming data into AWS. The data engineer wants to perform real-time analytics on the incoming...
Category: Data Operations and Support
A company receives a daily file that contains customer data in .xls format. The company stores the file in Amazon S3. The daily file is approximately ...
Category: Data Store Management
A data engineer maintains custom Python scripts that perform a data formatting process that many AWS Lambda functions use. When the data engineer need...
Category: Data Ingestion and Transformation
A data engineer must build an extract, transform, and load (ETL) pipeline to process and load data from 10 source systems into 10 tables that are in a...
Category: Data Operations and Support
A data engineer needs to securely transfer 5 TB of data from an on-premises data center to an Amazon S3 bucket. Approximately 5% of the data changes e...
Category: Data Store Management
A company is developing an application that runs on Amazon EC2 instances. Currently, the data that the application generates is temporary. However, th...
Category: Data Store Management
A company needs to partition the Amazon S3 storage that the company uses for a data lake. The partitioning will use a path of the S3 object keys in th...
Category: Data Operations and Support
A data engineer must orchestrate a data pipeline that consists of one AWS Lambda function and one AWS Glue job. The solution must integrate with AWS s...
Category: Data Operations and Support
A data engineer uses Amazon Redshift to run resource-intensive analytics processes once every month. Every month, the data engineer creates a new Reds...
Category: Data Operations and Support
A company maintains an Amazon Redshift provisioned cluster that the company uses for extract, transform, and load (ETL) operations to support critical...
Category: Data Security and Governance
A company stores data in a data lake that is in Amazon S3. Some data that the company stores in the data lake contains personally identifiable informa...
Category: Data Operations and Support
A company uses Amazon Redshift for its data warehouse. The company must automate refresh schedules for Amazon Redshift materialized views. Which solut...
Category: Data Operations and Support
A company needs to set up a data catalog and metadata management for data sources that run in the AWS Cloud. The company will use the data catalog to ...
Category: Data Operations and Support
A company extracts approximately 1 TB of data every day from data sources such as SAP HANA, Microsoft SQL Server, MongoDB, Apache Kafka, and Amazon Dy...
Category: Data Store Management
A company stores petabytes of data in thousands of Amazon S3 buckets in the S3 Standard storage class. The data supports analytics workloads that have...