Microsoft 70-773 - Analyzing Big Data with Microsoft R Exam
Page: 1 / 8
Total 39 questions
Question #1 (Topic: )
Note: This question Is part of a series of questions that use the same or similar
answer choice. An answer choice may be correct for more than one question in the
series. Each question is independent of the other questions in this series.
Information and details provided In a question apply only to that question.
You build a model that uses xyz regression.
You need to estimate a model that predicts a binary variable.
Which function should you use?
answer choice. An answer choice may be correct for more than one question in the
series. Each question is independent of the other questions in this series.
Information and details provided In a question apply only to that question.
You build a model that uses xyz regression.
You need to estimate a model that predicts a binary variable.
Which function should you use?
A. rxPredict
B. rxLogit
C. Summary
D. rxLinMod
E. rxTweedie
F. stepAic
G. rxTransform
H. rxDataStep
Answer: B
Question #2 (Topic: )
You are running a parallel function that uses the following R code segment. (Line numbers
are included for reference only.)
[Microsoft-70-773-7.0/Microsoft-70-773-2_2.png]
You need to complete the R code. The solution must support chunking.
are included for reference only.)
[Microsoft-70-773-7.0/Microsoft-70-773-2_2.png]
You need to complete the R code. The solution must support chunking.
A. rxBTrees
B. rxExec
C. rxDForest
D. rxDTree
Answer: C
Question #3 (Topic: )
You have a dataset that has multiple blocks and only numeric variables.
You are computing in a local compute context.
You plan to lag a variable named x to create a new variable named x_lagged by using a
transform function. You will create a new element in the output of the function.
You need to minimize the number of missing values.
Which three actions should you perform? Each correct answer presents part of the
solution.
NOTE: Each correct selection is worth one point.
You are computing in a local compute context.
You plan to lag a variable named x to create a new variable named x_lagged by using a
transform function. You will create a new element in the output of the function.
You need to minimize the number of missing values.
Which three actions should you perform? Each correct answer presents part of the
solution.
NOTE: Each correct selection is worth one point.
A. Assign a value to the first value of x_lagged in the current block.
B. Use rxSet to store the last value of x_lagged in the current block.
C. Use rxSet to store the last value of x in the current block.
D. Use rxGet to retrieve the first value of x in the next block to be processed.
E. Use rxGet to retrieve a value stored in processing of the prior block.
Answer: A,C,D
Question #4 (Topic: )
You need to run a larger data tree model by using rsDForest. The model must use cross
validation.
Which rxDForest option should you use?
validation.
Which rxDForest option should you use?
A. maxSurrogate
B. maxNumBins
C. maxDepth
D. maxCompete
E. xVal
Answer: E
Question #5 (Topic: )
Note: This question is part of a series of questions that use the same scenario. For your
convenience, the scenario is repeated in each question. Each question presents a different
goal and answer choices, but the text of the scenario is exactly the same in each question
in this series.
Start of repeated scenario
You are developing a Microsoft R Open solution that will leverage the computing power of
the database server for some of your datasets.
You are performing feature engineering and data preparation for the datasets.
The following is a sample of the dataset.
[Microsoft-70-773-7.0/Microsoft-70-773-4_2.png]
End of repeated scenario
You need to sort the data from the dataset sample and to remove duplicates by using
wkswork1.
Which R code segment should you use? to answer, select the appropriate options in the
answer area.
Note: Each correct selection is worth one point.
[Microsoft-70-773-7.0/Microsoft-70-773-4_3.png]
convenience, the scenario is repeated in each question. Each question presents a different
goal and answer choices, but the text of the scenario is exactly the same in each question
in this series.
Start of repeated scenario
You are developing a Microsoft R Open solution that will leverage the computing power of
the database server for some of your datasets.
You are performing feature engineering and data preparation for the datasets.
The following is a sample of the dataset.
[Microsoft-70-773-7.0/Microsoft-70-773-4_2.png]
End of repeated scenario
You need to sort the data from the dataset sample and to remove duplicates by using
wkswork1.
Which R code segment should you use? to answer, select the appropriate options in the
answer area.
Note: Each correct selection is worth one point.
[Microsoft-70-773-7.0/Microsoft-70-773-4_3.png]
Answer: [Microsoft-70-773-7.0/Microsoft-70-773-5_2.png]