IBM C2090-102 IBM Big Data Architect Exam Practice Test

Demo: 16 questions
Total 110 questions

Get C2090-102 Full Access Download C2090-102 PDF

IBM Big Data Architect Questions and Answers

Question 1

Which of the following statements regarding Big R is TRUE?

Options:

Missing data values must be handled by ETL processes prior to analyzing data with Big R

A bigr.frame loads data in memory for optimal performance

A Big R user is responsible for parallelizing the execution of the R functions being used in the R program

Performing a mathematical operation on a Big R vector variable will automatically loop through each item inthe vector

Question 2

Which of the following statements regarding Big R is TRUE?

Options:

Unless specified otherwise, Big R automatically assumes all data to be integers

Big R’s ‘bigr.frame’ is equivalent to R’s ‘data.frames’

When you execute Big R “apply” function, Big R transparently extracts data out of HDFS into the Big R engine

A data analyst using Big R employs MapReduce programming principles

Question 3

It’s helpful to look at the characteristics of big data along certain lines – for example, how the data is collected, analyzed and processed. There are many characteristics to consider. Which one of the following is NOT a characteristic that should be considered?

Options:

Data frequency and size

Software

Data source

Processing methodology

Question 4

Which of the following is a requirement for data retention and archival?

Options:

A format and storage repository for archived data

Public cloud

Hosting location

Solid-state technology

Question 5

A large Retailer (online and “brick & mortar”) processes data for analyzing marketing campaigns for their loyalty club members. The current process takes weeks for processing only 10% of social data. What is the most costeffective platform for processing and analyzing campaign results from social data on a daily basis using 100% dataset?

Options:

Enterprise Data Warehouse

BigInsights Open Data Platform

High Speed Mainfraime Processing

In Memory Computing

Question 6

The NameNode determines the rack id to which each DataNode belongs via the process outlined in which of the following?

Options:

HDFS APIs

Hadoop Rack Awareness

Job Tracker

NameNode High Availability

Question 7

Company K is designing their Big Data system. In their enterprise, they anticipate every 9 months there will be a big spike of new data on the order of multiple TB. Their company policy also dictates that data older than one year will be archived with a major clean up every 5 years. Cost is also a big issue. Which of the following provides the best design for these requirements?

Options:

Estimate the peak volume over a 5 year period and set up a Hadoop system with commodity HW andstorage to accommodate that volume

Estimate the peak volume over a 3 year period and set up a Hadoop system with NAS to accommodate theexpected volume

Use Cloud elasticity capabilities to handle the peak and valley data volume

Use SAN storage with compression to handle the peak and valley data volume

Question 8

A media company wants to enhance their subscription services to their end customers by incorporating Twitter, Wikis, Blogs, and Analysts data. The company likes to provide a real-time dashboard view for news articles, press releases, and analysts perspective to end customers based on their subscriptions. What solution blueprint would offer the best fit for this critical business requirement?

Options:

Watson Analytics

Big Insights, Spark, and Big SQL

Pure Data for Analytics PDA, and SPSS

InfoSphere Streams, BigInsights, and Watson Explorer

Question 9

Company A is searching for a browser-based visualization tool to perform analysis on vast amounts of data in any structure. They want to execute operations such as pivot, slice and dice, among others. Which of the following would meet these requirements?

Options:

Streams

BigSheets

Aginity Workbench

Watson Explorer

Question 10

Faced with a wide area network implementation, you have a need for asynchronous remote updates. Which one of the following would best address this use case?

Options:

GPFS Active File Management allows data access and modifications even when remote storage cluster is unavailable

HDFS Cluster rebalancing is compatible with data rebalancing schemes. A scheme might automatically move data from one DataNode to another if the free space on a DataNode falls below a certain threshold

GPFS File clones can be created from a regular file or a file in a snapshot using the mmclone command

HDFS NameNode The NameNode keeps an image of the entire file system namespace and file Blockmap in memory. This key metadata item is designed to be compact, such that a NameNode with 4 GB of RAM is plenty to support a huge number of files and directories

Question 11

Which one of the following statements is TRUE?

Options:

Big SQL uses Hadoop MR framework to process query tasks in parallel

Big SQL executes queries locally on Big SQL server single node on a multi node cluster

Big SQL can process queries in parallel and executes queries locally

Big SQL only works with HDFS

Question 12

Which of the following is NOT a valid Service Level Agreement (SLA) metric?

Options:

Mean time between failures

Mean time to repair

Identification to responsible party

Identification of failing component

Question 13

What is used to capture client requirements for software selection and to evaluate the initial functional “fit” of a vendor’s software solution to the business needs of the client?

Options:

Operational Model

Requirements Matrix

Viability Assessment

Use Case Model

Question 14

Which of the following is the section of the Component Model that details how the solution integrates?

Options:

Component Relationship Diagram

Component Interface Diagram

Component Interaction Diagram

Component Reaction Diagram

Question 15

BigInsights is a solution that accomplishes which of the following?

Options:

Replaces the traditional Data warehouses

Can exchange information with the traditional Data warehouses only

Includes a connector that enables data exchange between a BigInsights cluster and Netezza appliance in only one way

Supports data exchange with a number of sources

Question 16

Which of the following statements is TRUE?

Options:

A good use of the BigInsights is as a query-ready archival system for your data warehouse to quickly accessdata

BigInsights and Hadoop based systems in general are best used in high concurrency transactional systems

Hadoop map reduce based computing engine is ideal for use as a real-time or near real-time processingextension to your existing business intelligence reporting

The system ML engine is the preferable option to add unstructured text sentiment analysis to the customerservice reporting system

Load More C2090-102 Questions

Demo: 16 questions
Total 110 questions

Get C2090-102 Full Access Download C2090-102 PDF

Spring Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70percent

IBM C2090-102 IBM Big Data Architect Exam Practice Test

IBM Big Data Architect Questions and Answers

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation: