Cause all that matters here is passing the Microsoft 70-775 exam. Cause all that you need is a high score of 70-775 Perform Data Engineering on Microsoft Azure HDInsight (beta) exam. The only one thing you need to do is downloading Examcollection 70-775 exam study guides now. We will not let you down with our money-back guarantee.
Q11. DRAG DROP
You have a text file named Data/examples/product.txt that contains product information.
You need to create a new Apache Hive table, import the product information to the table, and then read the top 100 rows of the table.
Which four code segments should you use in sequence? To answer, move the appropriate code segments from the list of code segments to the answer area and arrange them in the correct order.
Answer:
Q12. Note: This question is part of a series of questions that present the same Scenario. Each question I the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution while others might not have correct solution.
You are implementing a batch processing solution by using Azure HDlnsight. You have a data stored in Azure.
You need to ensure that you can access the data by using Azure Active Directory (Azure AD) identities.
What should you do?
A. Use a shuffle join in an Apache Hive query that stores the data in a JSON format.
B. Use a broadcast join in an Apache Hive query that stores the data in an ORC format.
C. Increase the number of spark.executor.cores in an Apache Spark job that stores the data in a text format.
D. Increase the number of spark.executor.instances in an Apache Spark job that stores the data in a text format.
E. Decrease the level of parallelism in an Apache Spark job that Mores the data in a text format.
F. Use an action in an Apache Oozie workflow that stores the data in a text format.
G. Use an Azure Data Factory linked service that stores the data in Azure Data lake.
H. Use an Azure Data Factory linked service that stores the data In an Azure DocumentDB database.
Answer: H
Q13. Note: This question is part of a series of questions that present the same Scenario. Each question I the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution while others might not have correct solution.
You are building a security tracking solution in Apache Kafka to parse Security logs. The Security logs record an entry each time a user attempts to access an application. Each log entry contains the IP address used to make the attempt and the country from which the attempt originated.
You need to receive notifications when an IP address from outside of the United States is used to access the application.
Solution: Create two new consumers. Create a file import process to send messages. Start the producer.
Does this meet the goal?
A. Yes
B. No
Answer: B
Q14. You have an Apache Spark cluster in Azure HDInsight. You plan to join a large table and a lookup table.
You need to minimize data transfers during the join operation. What should you do?
A. Use the reduceByKey function
B. Use a Broadcast variable.
C. Repartition the data.
D. Use the DISK_ONLY storage level.
Answer: B
Q15. DRAG DROP
You have a text file named Data/examples/product.txt that contains product information.
You need to create a new Apache Hive table, import the product information to the table, and then read the top 100 rows of the table.
Which four code segments should you use in sequence? To answer, move the appropriate code segments from the list of code segments to the answer area and arrange them in the correct order.
Answer:
Q16. Note: This question is part of a series of questions that present the same Scenario. Each question I the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution while others might not have correct solution.
You are building a security tracking solution in Apache Kafka to parse Security logs. The Security logs record an entry each time a user attempts to access an application. Each log entry contains the IP address used to make the attempt and the country from which the attempt originated.
You need to receive notifications when an IP address from outside of the United States is used to access the application.
Solution: Create a consumer and a broker. Create a file import process to send messages. Run the producer.
Does this meet the goal?
A. Yes
B. No
Answer: A
Q17. You have an Apache Spark cluster in Azure HDInsight. You execute the following command,
%spark
import org.aache.spark.sql.hive.orc._ import org.apcahe.spark.sql._
What is the result of running the command?
A. the Hive ORC library is imported to Spark and external tables in ORC format are created.
B. the Spark library is imported and the data is loaded to an Apache Hive table.
C. the Hive ORC library is imported to Spark arid the ORC-formatted data stored in Apache Hive tables becomes accessible
D. the Spark library is imported and Scala functions are executed
Answer: D
Q18. You have an Apache Spark cluster in Azure HDInsight. You plan to join a large table and a lookup table.
You need to minimize data transfers during the join operation. What should you do?
A. Use the reduceByKey function
B. Use a Broadcast variable.
C. Repartition the data.
D. Use the DISK_ONLY storage level.
Answer: B
Q19. DRAG DROP
You have a text file named Data/examples/product.txt that contains product information.
You need to create a new Apache Hive table, import the product information to the table, and then read the top 100 rows of the table.
Which four code segments should you use in sequence? To answer, move the appropriate code segments from the list of code segments to the answer area and arrange them in the correct order.
Answer:
Q20. Note: This question is part of a series of questions that present the same Scenario. Each question I the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution while others might not have correct solution.
You are building a security tracking solution in Apache Kafka to parse Security logs. The Security logs record an entry each time a user attempts to access an application. Each log entry contains the IP address used to make the attempt and the country from which the attempt originated.
You need to receive notifications when an IP address from outside of the United States is used to access the application.
Solution: Create two new consumers. Create a file import process to send messages. Start the producer.
Does this meet the goal?
A. Yes
B. No
Answer: B