getcertified4sure.com

What Avant-garde 70-773 Is?




we provide Guaranteed Microsoft 70-773 exam which are the best for clearing 70-773 test, and to get certified by Microsoft Analyzing Big Data with Microsoft R (beta). The 70-773 Questions & Answers covers all the knowledge points of the real 70-773 exam. Crack your Microsoft 70-773 Exam with latest dumps, guaranteed!

Q1. Note: This Question is part of a series of Questions that use the same or similar answer choices. An answer choice may be correct than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.

You have a data source that is larger than memory.

You need to visualize the distribution of the values for a variable in the data source. What should you use?

A. the Describe package

B. the rxHistogram function

C. the rxSummary function

D. the rxQuantile function

E. the rxCube function

F. the summary function

G. the rxCrossTabs function

H. the ggplot2 package

Answer: B


Q2. You have cloud and on-premises resources that include Microsoft SQL Server and a big data environment in Apache Hadoop.

You have 50 billion fact records.

You need to build time series models to execute forecasting reports on the fact records. What should you use?

A. RxSpark on the Hadoop cluster

B. RxHadoopMR on the Hadoop cluster

C. RxLocalseq on the SQL Server database

D. RxLocalParallel on the SQL Server database

Answer: A


Q3. You have following regression forest.

 

Which variable contributes the most to the dependent variable?

A. stack.loss

B. Water.Temp

C. Air.Flow

D. Acid.Conc

Answer: A


Q4. You have one class support vector machines (SVMs).

You have a large dataset, but you do not have enough training time to fully test the model. What is an alternative method to validate the model?

A. Use Principal Components Analysis (PCA) Based Anomaly detection

B. B. Replace the SVMs with two class SVMs.

C. Perform feature selection.

D. Use outlier detection.

Answer: A


Q5. DRAG DROP

You need to set the compute context for three different target environments.

Which Statement should you use for each environment? To answer, drag the appropriate statements to the correct execution contexts. Each statement may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.

NOTE: Each correct selection is worth one point.

 

Answer:

 


Q6. You have a dataset that has multiple blocks and only numeric variables. You are computing in a local compute context.

You plan to lag a variable named x to create a new variable named x_lagged by using a transform function. You will create a new element in the output of the function.

You need to minimize the number of missing values.

Which three actions should you perform? Each correct answer presents part of the solution.

NOTE: Each correct selection is worth one point.

A. Assign a value to the first value of x_lagged in the current block.

B. Use rxSet to store the last value of x_lagged in the current block.

C. Use rxSet to store the last value of x in the current block.

D. Use rxGet to retrieve the first value of x in the next block to be processed.

E. Use rxGet to retrieve a value stored in processing of the prior block.

Answer: A,C,D


Q7. You have one class support vector machines (SVMs).

You have a large dataset, but you do not have enough training time to fully test the model. What is an alternative method to validate the model?

A. Use Principal Components Analysis (PCA) Based Anomaly detection

B. B. Replace the SVMs with two class SVMs.

C. Perform feature selection.

D. Use outlier detection.

Answer: A


Q8. DRAG DROP

You need to set the compute context for three different target environments.

Which Statement should you use for each environment? To answer, drag the appropriate statements to the correct execution contexts. Each statement may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.

NOTE: Each correct selection is worth one point.

 

Answer:

 


Q9. You are running a parallel function that uses the following R code segment. (Line numbers are included for reference only.)

 

You need to complete the R code. The solution must support chunking. Which function should insert at line 02?

A. rxBTrees

B. rxExec

C. rxDForest

D. rxDTree

Answer: C


Q10. Note: This question is part of a series of Questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, whale others might not have a correct solution-After you answer a question in this section, you will NOT be able to return to it- As a result, these questions will not appear in the review screen.

You use dplyrXdf and you discover that after you exit the session, the output files that were created were deleted. You need to prevent the files from being deleted.

Solution: You use dplyrXdf with the persist verb.

Does this meet the goal?

A. Yes

B. No

Answer: A