It is more faster and easier to pass the Microsoft 70-773 exam by using Certified Microsoft Analyzing Big Data with Microsoft R (beta) questuins and answers. Immediate access to the Leading 70-773 Exam and find the same core area 70-773 questions with professionally verified answers, then PASS your exam with a high score now.
Q11. You have a dataset that has multiple blocks and only numeric variables. You are computing in a local compute context.
You plan to lag a variable named x to create a new variable named x_lagged by using a transform function. You will create a new element in the output of the function.
You need to minimize the number of missing values.
Which three actions should you perform? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.
A. Assign a value to the first value of x_lagged in the current block.
B. Use rxSet to store the last value of x_lagged in the current block.
C. Use rxSet to store the last value of x in the current block.
D. Use rxGet to retrieve the first value of x in the next block to be processed.
E. Use rxGet to retrieve a value stored in processing of the prior block.
Answer: A,C,D
Q12. You are running a parallel function that uses the following R code segment. (Line numbers are included for reference only.)
You need to complete the R code. The solution must support chunking. Which function should insert at line 02?
A. rxBTrees
B. rxExec
C. rxDForest
D. rxDTree
Answer: C
Q13. You have following regression forest.
Which variable contributes the most to the dependent variable?
A. stack.loss
B. Water.Temp
C. Air.Flow
D. Acid.Conc
Answer: A
Q14. You have one class support vector machines (SVMs).
You have a large dataset, but you do not have enough training time to fully test the model. What is an alternative method to validate the model?
A. Use Principal Components Analysis (PCA) Based Anomaly detection
B. B. Replace the SVMs with two class SVMs.
C. Perform feature selection.
D. Use outlier detection.
Answer: A
Q15. HOTSPOT
Note: This question is part of a series of questions that use the same scenario. For your convenience, the scenario is repeated in each question. Each question presents a different goal and answer choices, but the text of the scenario is exactly the same in each question in this series.
Start of repeated scenario
You are developing a Microsoft R Open solution that will leverage the computing power of the database server for some of your datasets.
You are performing feature engineering and data preparation for the datasets. The following is a sample of the dataset.
End of repeated scenario
You need to sort the data from the dataset sample and to remove duplicates by using wkswork1.
Which R code segment should you use? to answer, select the appropriate options in the
answer area.
Note: Each correct selection is worth one point.
Answer:
Q16. You are running a large logistic regression for 1,000 feature variables by using the logisticRegression0 function in the MicrosoftML package. All of the predictor variables are numeric.
Currently, you specify the input variables separately by using the following formula.
You discover that it takes 20 minutes to estimate each model.
You need to reduce the amount of time required to estimate each model without losing any information in the predictors.
What should you do?
A. Use stepControl0 to perform stepwise regression to limit the number of variables that contribute to the model.
B. Use selectFeatures0 to select the features that provide the most information about the outcome variable.
C. Use princomp0 on the correlation matrix of Features, and then use only the first 100 principle components to reduce the number of input variables.
D. Use concat0 to create a single array variable named Features, and then specify a new
formula named Outcome - Features.
Answer: B
Q17. You have one class support vector machines (SVMs).
You have a large dataset, but you do not have enough training time to fully test the model. What is an alternative method to validate the model?
A. Use Principal Components Analysis (PCA) Based Anomaly detection
B. B. Replace the SVMs with two class SVMs.
C. Perform feature selection.
D. Use outlier detection.
Answer: A
Q18. You have a dataset that has multiple blocks and only numeric variables. You are computing in a local compute context.
You plan to lag a variable named x to create a new variable named x_lagged by using a transform function. You will create a new element in the output of the function.
You need to minimize the number of missing values.
Which three actions should you perform? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.
A. Assign a value to the first value of x_lagged in the current block.
B. Use rxSet to store the last value of x_lagged in the current block.
C. Use rxSet to store the last value of x in the current block.
D. Use rxGet to retrieve the first value of x in the next block to be processed.
E. Use rxGet to retrieve a value stored in processing of the prior block.
Answer: A,C,D
Q19. Note: This question is part of a series of Questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, whale others might not have a correct solution-After you answer a question in this section, you will NOT be able to return to it- As a result, these questions will not appear in the review screen.
You use dplyrXdf and you discover that after you exit the session, the output files that were created were deleted. You need to prevent the files from being deleted.
Solution: You use dplyrXdf with the outFile parameter and specify a path other than the working directory for dplyrXdf.
Does this meet the goal?
A. Yes
B. No
Answer: A
Q20. Note: This question is part of a series of Questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, whale others might not have a correct solution-After you answer a question in this section, you will NOT be able to return to it- As a result, these questions will not appear in the review screen.
You have a Microsoft SQL Server instance that has R Services (In-Database) installed. You need to monitor the R jobs that are sent to SQL Server.
Solution: You create an events trace configuration file and place the file in the same directory as the BXLServer process.
Does this meet the goal?
A. Yes
B. No
Answer: B