Labour Day Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70percent

Amazon Web Services BDS-C00 AWS Certified Big Data -Speciality Exam Practice Test

Demo: 39 questions
Total 264 questions

AWS Certified Big Data -Speciality Questions and Answers

Question 1

To help you manage your Amazon EC2 instances, images, and other Amazon EC2 resources, you can assign your own metadata to each resource in the form of____________

Options:

A.

special filters

B.

functions

C.

tags

D.

wildcards

Question 2

A user has created a launch configuration for Auto Scaling where CloudWatch detailed monitoring is disabled. The user wants to now enable detailed monitoring. How can the user achieve this?

Options:

A.

Update the Launch config with CLI to set InstanceMonitoringDisabled = false

B.

The user should change the Auto Scaling group from the AWS console to enable detailed monitoring

C.

Update the Launch config with CLI to set InstanceMonitoring.Enabled = true

D.

Create a new Launch Config with detail monitoring enabled and update the Auto Scaling group

Question 3

A company with a support organization needs support engineers to be able to search historic cases to provide fast responses on new issues raised. The company has forwarded all support messages into an Amazon Kinesis Stream. This meets a company objective of using only managed services to reduce.

The company needs an appropriate architecture that allows support engineers to search on historic cases can find similar issues and their associated responses.

Which AWS Lambda action is most appropriate?

Options:

A.

Ingest and index the content into an Amazon Elasticsearch domain

B.

Stem and tokenize the input and store the results into Amazon ElastiCache

C.

Write data as JSON into Amazon DynamoDB with primary and secondary indexes

D.

Aggregate feedback is Amazon S3 using a columnar format with partitioning

Question 4

A user is running a webserver on EC2. The user wants to receive the SMS when the EC2 instance utilization is above the threshold limit. Which AWS services should the user configure in this case?

Options:

A.

AWS CloudWatch + AWS SES

B.

AWS CloudWatch + AWS SNS

C.

AWS CloudWatch + AWS SQS

D.

AWS EC2 + AWS CloudWatch

Question 5

You are currently hosting multiple applications in a VPC and have logged numerous port scans coming in from a specific IP address block. Your security team has requested that all access from the offending IP address block be denied for the next 24 hours.

Which of the following is the best method to quickly and temporarily deny access from the specified IP address block?

Options:

A.

Create an AD policy to modify Windows Firewall settings on all hosts in the VPC to deny access from the IP address block

B.

Modify the Network ACLs associated with all public subnets in the VPC to deny access from the IP address block

C.

Add a rule to all of the VPC 5 Security Groups to deny access from the IP address block D. Modify the Windows Firewall settings on all Amazon Machine Images (AMIs) that your organization uses in that VPC to deny access from the IP address block

Question 6

Will I be charged if the DB instance is idle?

Options:

A.

No

B.

Yes

C.

Only is running in GovCloud

D.

Only if running in VPC

Question 7

In the Amazon RDS Oracle DB engine, the Database Diagnostic Pack and the Database Tuning Pack are only available with ______________

Options:

A.

Oracle Standard Edition

B.

Oracle Express Edition

C.

Oracle Enterprise Edition

D.

None of these

Question 8

You run a web application with the following components Elastic Load Balancer (ELB), 3 Web/Application servers, 1 MySQL RDS database with read replicas, and Amazon Simple Storage Service (Amazon S3) for static content. Average response time for users is increasing slowly. What three CloudWatch RDS metrics will allow you to identify if the database is the bottleneck? Choose 3 answers

Options:

A.

The number of outstanding IOs waiting to access the disk

B.

The amount of write latency

C.

The amount of disk space occupied by binary logs on the master.

D.

The amount of time a Read Replica DB Instance lags behind the source DB Instance

E.

The average number of disk I/O operations per second.

Question 9

An organization has configured a VPC with an Internet Gateway (IGW). Pairs of public and private subnets (each with one subnet per Availability Zone), and an Elastic Load Balancer (ELB) configured to use the public subnets. The application’s web tier leverages the ELB. Auto Scaling and a multi-AZ RDS database instance the organization would like to eliminate any potential single points of failure in this design.

What step should you take to achieve this organization's objective?

Options:

A.

Nothing, there are no single points of failure in this architecture.

B.

Create and attach a second IGW to provide redundant internet connectivity.

C.

Create and configure a second Elastic Load Balancer to provide a redundant load balancer.

D.

Create a second multi-AZ RDS instance in another Availability Zone and configure replication to provide a redundant database.

Question 10

If I modify a DB Instance or the DB parameter group associated with the instance, should I reboot the instance for the changes to take effect?

Options:

A.

No

B.

Yes

Question 11

After an Amazon VPC instance is launched, can I change the VPC security groups it belongs to?

Options:

A.

No. You cannot.

B.

Yes. You can.

C.

Only if you are the root user

D.

Only if the tag "VPC_Change_Group" is true

Question 12

A customer wants to track access to their Amazon Simple Storage Service (S3) buckets and also use this information for their internal security and access audits. Which of the following will meet the Customer requirement?

Options:

A.

Enable AWS CloudTrail to audit all Amazon S3 bucket access.

B.

Enable server access logging for all required Amazon S3 buckets. C. Enable the Requester Pays option to track access via AWS Billing D. Enable Amazon S3 event notifications for Put and Post.

Question 13

A user is planning to use the AWS RDS with MySQL. Which of the below mentioned services the user is not going to pay?

Options:

A.

Data transfer

B.

RDS CloudWatch metrics

C.

Data storage

D.

I/O requests per month

Question 14

You have been asked to handle a large data migration from multiple Amazon RDS MySQL instances to a DynamoDB table. You have been given a short amount of time to complete the data migration. What will allow you to complete this complex data processing workflow?

Options:

A.

Create an Amazon Kinesis data stream, pipe in all of the Amazon RDS data, and direct data toward DynamoDB table

B.

Write a script in you language of choice, install the script on an Amazon EC2 instance, and then use Auto Scaling groups to ensure that the latency of the mitigation pipelines never exceeds four seconds in any 15-minutes period.

C.

Write a bash script to run on your Amazon RDS instance that will export data into DynamoDB

D.

Create a data pipeline to export Amazon RDS data and import the data into DynamoDB

Question 15

Without _____, you must either create multiple AWS accounts-each with its own billing and subscriptions to AWS products-or your employees must share the security credentials of a single AWS account.

Options:

A.

Amazon RDS

B.

Amazon Glacier

C.

Amazon EMR

D.

Amazon IAM

Question 16

You have an application running on an Amazon Elastic Compute Cloud instance, that uploads 5 GB video objects to Amazon Simple Storage Service (S3). Video uploads are taking longer than expected, resulting in poor application performance. Which method will help improve performance of your application?

Options:

A.

Enable enhanced networking

B.

Use Amazon S3 multipart upload

C.

Leveraging Amazon CloudFront, use the HTTP POST method to reduce latency.

D.

Use Amazon Elastic Block Store Provisioned IOPs and use an Amazon EBS-optimized instance

Question 17

What's an ECU?

Options:

A.

Extended Cluster User.

B.

None of these.

C.

Elastic Computer Usage.

D.

Elastic Compute Unit.

Question 18

A customer is collecting clickstream data using Amazon kinesis and is grouping the events by IP address into 5-minute chunks stored in Amazon S3.

Many analysts in the company use Hive on Amazon EMR to analyze this data. Their queries always reference a single IP address. Data must be optimized for querying based on UP address using Hive running on Amazon EMR. What is the most efficient method to query the data with Hive?

Options:

A.

Store an index of the files by IP address in the Amazon DynamoDB metadata store for EMRFS

B.

Store the Amazon S3 objects with the following naming scheme:

bucketname/source=ip_address/year=yy/month=mm/day=dd/hour=hh/filename

C.

Store the data in an HBase table with the IP address as the row key

D.

Store the events for an IP address as a single file in Amazon S3 and add metadata with key:Hive_Partitioned_IPAddress

Question 19

An organization would like to run analytics on their Elastic Load Balancing logs stored in Amazon S3 and join this data with other tables in Amazon S3. The users are currently using a BI tool connecting with JDBC and would like to keep using this BI tool.

Which solution would result in the LEAST operational overhead?

Options:

A.

Trigger a Lambda function when a new log file is added to the bucket to transform and load it into Amazon Redshift. Run the VACUUM command on the Amazon Redshift cluster every night.

B.

Launch a long-running Amazon EMR cluster that continuously downloads and transforms new files from Amazon S3 into its HDFS storage. Use Presto to expose the data through JDBC.

C.

Trigger a Lambda function when a new log file is added to the bucket to transform and move it to another bucket with an optimized data structure. Use Amazon Athena to query the optimized bucket.

D.

Launch a transient Amazon EMR cluster every night that transforms new log files and loads them into Amazon Redshift.

Question 20

Which of these configuration or deployment practices is a security risk for RDS?

Options:

A.

Storing SQL function code in plaintext

B.

Non-Multi-AZ RDS instance

C.

Having RDS and EC2 instances exist in the same subnet

D.

RDS in a public subnet

Question 21

Which of the following notification endpoints or clients are supported by Amazon Simple

Notification Service? Choose 2 answers

Options:

A.

Email

B.

CloudFront distribution

C.

File Transfer Protocol

D.

Short Message Service

E.

Simple Network Management Protocol

Question 22

You are configuring your company’s application to use Auto Scaling and need to move user state information. Which of the following AWS services provides a shared data store with durability and low latency?

Options:

A.

Amazon Simple Storage Service

B.

Amazon DynamoDB

C.

Amazon EC2 instance storage

D.

AWS ElasticCache Memcached

Question 23

You have launched an Amazon Elastic Compute Cloud (EC2) instance into a public subnet with a primary private IP address assigned, an internet gateway is attached to the VPC, and the public route table is configured to send all internet-based internet. Why is the internet unreachable from this instance?

Options:

A.

The Internet gateway security group must allow all outbound traffic

B.

The instance does not have a public IP address

C.

The instance “Source/Destination check” property must be enabled

D.

The instance security group must allow all inbound traffic

Question 24

A customer has an Amazon S3 bucket. Objects are uploaded simultaneously by a cluster of servers from multiple streams of data. The customer maintains a catalog of objects uploaded in Amazon S3 using an Amazon DynamoDB table. This catalog has the following fields StreamName, TimeStamp, and ServerName, TimeStamp, and ServerName, from which ObjectName can be obtained.

The customer needs to define the catalog to support querying for a given stream or server within a defined time range.

Which DynamoDB table scheme is most efficient to support these queries?

Options:

A.

Define a Primary Key with ServerName as Partition Key and TimeStamp as Sort Key. Don NOT define a Secondary Index or Global Secondary Index.

B.

Define a Primary Key with StreamName as Partition Key and TimeStamp followed by ServerName as Sort Key. Define a Global Secondary Index with ServerName as Partition Key and TimeStamp followed by StreamName.

C.

Define a Primary Key with ServerName as Partition Key. Define a Local Secondary Index with StreamName as Partition Key. Define a Global Secondary Index with TimeStamp as Partition Key.

D.

Define a Primary Key with ServerName as Partition Key. Define a Local Secondary Index with TimeStamp as Partition Key. Define a Global Secondary Index with StreamName as Partition key and TimeStamp as Sort Key.

Question 25

A company needs to deploy services to an AWS region which they not previously used. The company currently has an AWS identity and Access Management (IAM) role for their Amazon EC2 instances, which permits the instance to have access to Amazon DynamoDB. The company wants their EC2 instances in the new region to have the same privileges. How should the company achieve this?

Options:

A.

Create a new IAM role and associated policies within the new region

B.

Assign the existing IAM role to the Amazon EC2 instances in the new region

C.

Copy the IAM role and associated policies to the new region and attach it to the instances

D.

Create the Amazon Machine Image of the instance and copy it to the desired region using the

AMI Copy feature

Question 26

What are the two types of licensing options available for using Amazon RDS for Oracle?

Options:

A.

BYOL and Enterprise License

B.

BYOL and License Included

C.

Enterprise License and License Included

D.

Role based License and License Included

Question 27

REST or Query requests are HTTP or HTTPS requests that use an HTTP verb (such as GET or POST) and a parameter named Action or Operation that specifies the API you are calling.

Options:

A.

FALSE

B.

TRUE

Question 28

You have a video Trans coding application running on Amazon EC2. Each instance pools a queue to find out which video should be Trans coded, and then runs a Trans coding process.

If this process is interrupted, the video will be Trans coded by another instance based on the queuing system. You have a large backlog of videos which need to be Trans coded and would like to reduce this backlog by adding more instances. You will need these instances only until the backlog is reduced. Which type of Amazon EC2 instance should you use to reduce the backlog in the most cost-effective way?

Options:

A.

Dedicated instances

B.

Spot instances

C.

On-demand instances

D.

Reserved instances

Question 29

If your DB instance runs out of storage space or file system resources, its status will change to_____ and your DB Instance will no longer be available.

Options:

A.

storage-overflow

B.

storage-full

C.

storage-exceed

D.

storage-overage

Question 30

A company has reproducible data that they want to store on Amazon Web Services. The company may want to retrieve the data on a frequent basis. Which Amazon web services storage option allows the customer to optimize storage costs and still achieve high availability for their data?

Options:

A.

Amazon S3 Reduced Redundancy Storage

B.

Amazon EBS Magnetic Volume

C.

Amazon Glacier

D.

Amazon S3 Standard Storage

Question 31

An AWS customer is deploying a web application that is composed of a front-end running on Amazon EC2 and of confidential data that is stored on Amazon S3. The customer security policy that all access operations to this sensitive data must be authenticated and authorized by a centralized access management system that is operated by a separate security team. In addition, the web application team that owns and administers the EC2 web front-end instances is prohibited from having any ability to access the data that circumvents this centralized access management system. Which of the following configurations will support these requirements? 

Options:

A.

Encrypt the data on Amazon S3 using a CloudHSM that is operated by the separate security team. Configure the web application to integrate with the CloudHSM for decrypting approved data access operations for trusted end-users.

B.

Configure the web application to authenticate end-users against the centralized access management system. Have the web application provision trusted users STS tokens entitling the download of approved data directly from Amazon S3

C.

Have the separate security team create and IAM role that is entitled to access the data on Amazon S3. Have the web application team provision their instances with this role while denying their IAM users access to the data on Amazon S3

D.

Configure the web application to authenticate end-users against the centralized access management system using SAML. Have the end-users authenticate to IAM using their SAML token and download the approved data directly from S3.

Question 32

Multiple rows in an Amazon Redshift table were accidentally deleted. A System Administrator is restoring the table from the most recent snapshot. The snapshot contains all rows that were in the table before the deletion.

What is the SIMPLEST solution to restore the table without impacting users?

Options:

A.

Restore the snapshot to a new Amazon Redshift cluster, then UNLOAD the table to Amazon S3. In the original cluster, TRUNCATE the table, then load the data from Amazon S3 by using a COPY command.

B.

Use the Restore Table from a Snapshot command and specify a new table name DROP the original table, then RENAME the new table to the original table name.

C.

Restore the snapshot to a new Amazon Redshift cluster. Create a DBLINK between the two clusters in the original cluster, TRUNCATE the destination table, then use an INSERT command to copy the data from the new cluster.

D.

Use the ALTER TABLE REVERT command and specify a time stamp of immediately before the data deletion. Specify the Amazon Resource Name of the snapshot as the SOURCE and use the OVERWRITE REPLACE option.

Question 33

A real-time bidding company is rebuilding their monolithic application and is focusing on serving real-time data. A large number of reads and writes are generated from thousands of concurrent users who follow items and bid on the company’s sale offers.

The company is experiencing high latency during special event spikes, with millions of concurrent users.

The company needs to analyze and aggregate a part of the data in near real time to feed an internal dashboard.

What is the BEST approach for serving and analyzing data, considering the constraint of the row latency on the highly demanded data?

Options:

A.

Use Amazon Aurora with Multi Availability Zone and read replicas. Use Amazon ElastiCache in front of the read replicas to serve read-only content quickly. Use the same database as datasource for the dashboard.

B.

Use Amazon DynamoDB to store real-time data with Amazon DynamoDB. Accelerator to serve content quickly. use Amazon DynamoDB Streams to replay all changes to the table, process and stream to Amazon Elasti search Service with AWS Lambda.

C.

Use Amazon RDS with Multi Availability Zone. Provisioned IOPS EBS volume for storage. Enable up to five read replicas to serve read-only content quickly. Use Amazon EMR with Sqoop to import Amazon RDS data into HDFS for analysis.

D.

Use Amazon Redshift with a DC2 node type and a multi-mode cluster. Create an Amazon EC2 instance with pgpoo1 installed. Create an Amazon ElastiCache cluster and route read requests through pgpoo1, and use Amazon Redshift for analysis.

Question 34

A company needs to monitor the read and write IOPs metrics for their AWS MySQL RDS instances and send real-time alerts to their operations team. Which AWS services can accomplish this?

Choose 2 answers

Options:

A.

Amazon Simple Email Service

B.

Amazon CloudWatch

C.

Amazon Simple Queue Service

D.

Amazon Route 53

E.

Amazon Simple Notification Service

Question 35

Fill in the blanks: A_____ is a storage device that moves data in sequences of bytes or bits (blocks). Hint: These devices support random access and generally use buffered I/O.

Options:

A.

block map

B.

storage block

C.

mapping device

D.

block device

Question 36

Amazon RDS creates an SSL certificate and installs the certificate on the DB Instance when Amazon RDS provisions the instance. These certificates are signed by a certificate authority. The _____ is stored athttps://rds.amazonaws.com/doc/rds-ssl-ca-cert.pem.

Options:

A.

private key

B.

foreign key

C.

public key

D.

protected key

Question 37

A company needs to deploy virtual desktops to its customers in a virtual private cloud, leveraging existing security controls. Which set of AWS services and features will meet the company’s requirements?

Options:

A.

Virtual private network connection, AWS Directory services, and ClassicLink

B.

Virtual private network connection, AWS Directory services, and Amazon WorkSpaces

C.

AWS Directory service, Amazon WorkSpaces, and AWS Identity and Access Management

D.

Amazon Elastic Compute Cloud, and AWS identity and access management

Question 38

Amazon S3 doesn't automatically give a user who creates _____ permission to perform other actions on that bucket or object.

Options:

A.

a file

B.

a bucket or object

C.

a bucket or file

D.

a object or file

Question 39

A data engineer wants to use an Amazon Elastic Map Reduce for an application. The Data engineer needs to make sure it complies with regulatory requirements. The auditor must be able to confirm at any point which servers are running and which network access controls are deployed.

Which action should the data engineer take to meet this requirement?

Options:

A.

Provide the auditor IAM accounts with the SecurityAudit policy attached to their group.

B.

Provide the auditor with SSH keys for access to the Amazon EMR cluster.

C.

Provide the auditor with CloudFormation templates.

D.

Provide the auditor with access the AWS DirectConnect to use their existing tools.

Demo: 39 questions
Total 264 questions