CompTIA DY0-001 - CompTIA DataX Exam
Page: 1 / 17
Total 83 questions
Question #1 (Topic: Exam A)
Which of the following issues should a data scientist be most concerned about when generating a synthetic data set?
A. The data set consuming too many resources
B. The data set having insufficient features
C. The data set having insufficient row observations
D. The data set not being representative of the population
Answer: D
Question #2 (Topic: Exam A)
A data scientist is performing a linear regression and wants to construct a model that explains the most variation in the data. Which of the following should the data scientist maximize when evaluating the regression performance metrics?
A. Accuracy
B. R2
C. p value
D. AUC
Answer: B
Question #3 (Topic: Exam A)
A data scientist is building an inferential model with a single predictor variable. A scatter plot of the independent variable against the real-number dependent variable shows a strong relationship between them. The predictor variable is normally distributed with very few outliers. Which of the following algorithms is the best fit for this model, given the data scientist wants the model to be easily interpreted?
A. A logistic regression
B. An exponential regression
C. A linear regression
D. A probit regression
Answer: C
Question #4 (Topic: Exam A)
A data scientist wants to evaluate the performance of various nonlinear models. Which of the following is best suited for this task?
A. AIC
B. Chi-squared test
C. MCC
D. ANOVA
Answer: A
Question #5 (Topic: Exam A)
Which of the following is the layer that is responsible for the depth in deep learning?
A. Convolution
B. Dropout
C. Pooling
D. Hidden
Answer: D