March Sale Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70percent

IBM C2090-102 IBM Big Data Architect Exam Practice Test

Demo: 16 questions
Total 110 questions

IBM Big Data Architect Questions and Answers

Question 1

Which of the following statements regarding Big R is TRUE?

Options:

A.

Missing data values must be handled by ETL processes prior to analyzing data with Big R

B.

A bigr.frame loads data in memory for optimal performance

C.

A Big R user is responsible for parallelizing the execution of the R functions being used in the R program

D.

Performing a mathematical operation on a Big R vector variable will automatically loop through each item inthe vector

Question 2

Which of the following statements regarding Big R is TRUE?

Options:

A.

Unless specified otherwise, Big R automatically assumes all data to be integers

B.

Big R’s ‘bigr.frame’ is equivalent to R’s ‘data.frames’

C.

When you execute Big R “apply” function, Big R transparently extracts data out of HDFS into the Big R engine

D.

A data analyst using Big R employs MapReduce programming principles

Question 3

It’s helpful to look at the characteristics of big data along certain lines – for example, how the data is collected, analyzed and processed. There are many characteristics to consider. Which one of the following is NOT a characteristic that should be considered?

Options:

A.

Data frequency and size

B.

Software

C.

Data source

D.

Processing methodology

Question 4

Which of the following is a requirement for data retention and archival?

Options:

A.

A format and storage repository for archived data

B.

Public cloud

C.

Hosting location

D.

Solid-state technology

Question 5

A large Retailer (online and “brick & mortar”) processes data for analyzing marketing campaigns for their loyalty club members. The current process takes weeks for processing only 10% of social data. What is the most costeffective platform for processing and analyzing campaign results from social data on a daily basis using 100% dataset?

Options:

A.

Enterprise Data Warehouse

B.

BigInsights Open Data Platform

C.

High Speed Mainfraime Processing

D.

In Memory Computing

Question 6

The NameNode determines the rack id to which each DataNode belongs via the process outlined in which of the following?

Options:

A.

HDFS APIs

B.

Hadoop Rack Awareness

C.

Job Tracker

D.

NameNode High Availability

Question 7

Company K is designing their Big Data system. In their enterprise, they anticipate every 9 months there will be a big spike of new data on the order of multiple TB. Their company policy also dictates that data older than one year will be archived with a major clean up every 5 years. Cost is also a big issue. Which of the following provides the best design for these requirements?

Options:

A.

Estimate the peak volume over a 5 year period and set up a Hadoop system with commodity HW andstorage to accommodate that volume

B.

Estimate the peak volume over a 3 year period and set up a Hadoop system with NAS to accommodate theexpected volume

C.

Use Cloud elasticity capabilities to handle the peak and valley data volume

D.

Use SAN storage with compression to handle the peak and valley data volume

Question 8

A media company wants to enhance their subscription services to their end customers by incorporating Twitter, Wikis, Blogs, and Analysts data. The company likes to provide a real-time dashboard view for news articles, press releases, and analysts perspective to end customers based on their subscriptions. What solution blueprint would offer the best fit for this critical business requirement?

Options:

A.

Watson Analytics

B.

Big Insights, Spark, and Big SQL

C.

Pure Data for Analytics PDA, and SPSS

D.

InfoSphere Streams, BigInsights, and Watson Explorer

Question 9

Company A is searching for a browser-based visualization tool to perform analysis on vast amounts of data in any structure. They want to execute operations such as pivot, slice and dice, among others. Which of the following would meet these requirements?

Options:

A.

Streams

B.

BigSheets

C.

Aginity Workbench

D.

Watson Explorer

Question 10

Faced with a wide area network implementation, you have a need for asynchronous remote updates. Which one of the following would best address this use case?

Options:

A.

GPFS Active File Management allows data access and modifications even when remote storage cluster is unavailable

B.

HDFS Cluster rebalancing is compatible with data rebalancing schemes. A scheme might automatically move data from one DataNode to another if the free space on a DataNode falls below a certain threshold

C.

GPFS File clones can be created from a regular file or a file in a snapshot using the mmclone command

D.

HDFS NameNode The NameNode keeps an image of the entire file system namespace and file Blockmap in memory. This key metadata item is designed to be compact, such that a NameNode with 4 GB of RAM is plenty to support a huge number of files and directories

Question 11

Which one of the following statements is TRUE?

Options:

A.

Big SQL uses Hadoop MR framework to process query tasks in parallel

B.

Big SQL executes queries locally on Big SQL server single node on a multi node cluster

C.

Big SQL can process queries in parallel and executes queries locally

D.

Big SQL only works with HDFS

Question 12

Which of the following is NOT a valid Service Level Agreement (SLA) metric?

Options:

A.

Mean time between failures

B.

Mean time to repair

C.

Identification to responsible party

D.

Identification of failing component

Question 13

What is used to capture client requirements for software selection and to evaluate the initial functional “fit” of a vendor’s software solution to the business needs of the client?

Options:

A.

Operational Model

B.

Requirements Matrix

C.

Viability Assessment

D.

Use Case Model

Question 14

Which of the following is the section of the Component Model that details how the solution integrates?

Options:

A.

Component Relationship Diagram

B.

Component Interface Diagram

C.

Component Interaction Diagram

D.

Component Reaction Diagram

Question 15

BigInsights is a solution that accomplishes which of the following?

Options:

A.

Replaces the traditional Data warehouses

B.

Can exchange information with the traditional Data warehouses only

C.

Includes a connector that enables data exchange between a BigInsights cluster and Netezza appliance in only one way

D.

Supports data exchange with a number of sources

Question 16

Which of the following statements is TRUE?

Options:

A.

A good use of the BigInsights is as a query-ready archival system for your data warehouse to quickly accessdata

B.

BigInsights and Hadoop based systems in general are best used in high concurrency transactional systems

C.

Hadoop map reduce based computing engine is ideal for use as a real-time or near real-time processingextension to your existing business intelligence reporting

D.

The system ML engine is the preferable option to add unstructured text sentiment analysis to the customerservice reporting system

Demo: 16 questions
Total 110 questions