The CAP Theorem states that it is not possible for a distributed computer system to guarantee all three of these?
Answer : B
You have implemented a large Hadoop MapReduce cluster and the applications and users are multiplying. You are now faced with requests for interactive and streaming data applications while you still need to support the original MapReduce batch Jobs. Select the best option for continued support and performance.
Answer : D
For company B, 85% of their analytics queries only involve about 25% of their data; another 10% of the queries will touch 35% of the rest of the data, and only 5% of the queries will touch the remaining 40% of the data. The estimated volume is 50TB growing at
1 TB per year. Which of the following would provide the best value (business benefit) and lowest TCO?
Answer : C
Which of the following statements regarding Big R is TRUE?
Answer : D
Explanation:
References: http://www.computerworld.com/article/2497319/business-intelligence- beginner-s-guide-to-r-syntax-quirks-you-llwant-to-know.html
Which architecture document is used to help organize projects, manage the complexity of the solution, and ensurethat all architecturerequirements have been addressed?
Answer : B
In a typical Hadoop HA cluster, two separate machines are configured as which of the following?
Answer : A
Reference:
http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop- hdfs/HDFSHighAvailabilityWithQJM.html
Which of the following statements regarding Big R is TRUE?
Answer : A
Reference:
https://developer.ibm.com/hadoop/docs/biginsights-value-add/big-r/bigr-tutorial/
A component of IBM Industry Model forms the basis of the Logical Data Warehouse
Model that spans across the traditional RDBMS and Hadoop technology. It defines all of the data structures that would be expected to be defined in the Detailed System of Record.
What is the name of this component?
Answer : A
Explanation:
References:
http://www.ibm.com/support/knowledgecenter/SS9NBR_9.1.0/com.ibm.ima.using/comp/bd m/intro.dita
In designing a new Hadoop system for a customer, the option of using SAN versus DAS was brought up. Which of the following would justify choosing SAN storage?
Answer : D
Which of the following is the section of the Component Model that details how the solution integrates?
Answer : A
You are designing storage for a new Hadoop cluster. Which of the following statements is
TRUE regarding the usage of SAN or NAS?
Answer : A
Explanation:
References: http:// www-01.ibm.com/software/data/infosphere/hadoop/hdfs/
Company A has decided to implement a new data system to support their rapidly growing business. They have an existing 20 TB worth of raw data, with an expected weekly incoming rate of 50 GB of new raw data. The data is mostly text based and unstructured. A typical query can involve pulling in 10 GB of data. Historically, performance has been an issue and currently needs to be addressed. Which of the following would you suggest to support these requirements?
Answer : A
Which of the following is NOT a valid Service Level Agreement (SLA) metric?
Answer : D
Explanation:
References: https://en.wikipedia.org/wiki/Service-level_agreement
A media company collects customer behavior data, such as how frequently they tune in, specific viewing habits, and peak usage in real time, in order to improve their services.The company likes to segmentits customers for advertisers by correlating viewing habits with public data, such as voter registration, in order to launch highly targeted campaigns to specific demographics. What technology should their Data Architect consider?
Answer : D
Reference:
http://www.ibm.com/software/data/puredata/analytics/nztechnology/analytics.html
A large Retailer (online and brick & mortar) processes data for analyzing marketing campaigns for their loyalty club members. The current process takes weeks for processing only 10% of social data. What is the most costeffective platform for processing and analyzing campaign results from social data on a daily basis using 100% dataset?
Answer : B
Explanation:
References: http://www.ibm.com/developerworks/data/library/techarticle/dm-
1110biginsightsintro/