IT認証試験問題集
毎月、ITshikenは1500人以上の受験者が試験準備を助けて、試験に合格するために受験者にご協力します
 ホームページ / 70-773 問題集  / 70-773 問題練習

Microsoft 70-773 問題練習

Analyzing Big Data with Microsoft R 試験

最新更新時間: 2019/07/06,合計39問。

平成から令和へ:70-773 最新真題を買う時、日本語版と英語版両方を同時に獲得できます。

実際の問題集を練習し、試験のポイントを了解し、テストに申し込むするかどうかを決めることができます。

さらに試験準備時間の35%を節約するには、70-773 問題集を使用してください。

 / 3

Question No : 1
You have a dataset that has multiple blocks and only numeric variables.
You are computing in a local compute context.
You plan to lag a variable named x to create a new variable named x_lagged by using a transform function. You will create a new element in the output of the function.
You need to minimize the number of missing values.
Which three actions should you perform? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.

正解:
Explanation:

Question No : 2
You are running a large logistic regression for 1,000 feature variables by using the logisticRegression0 function in the MicrosoftML package. All of the predictor variables are numeric.
Currently, you specify the input variables separately by using the following formula.



You discover that it takes 20 minutes to estimate each model.
You need to reduce the amount of time required to estimate each model without losing any information in the predictors.
What should you do?

正解:

Question No : 3
You have a dataset that has a character variable.
You need to create a bag of counts of n-grams.
Which function should you use?

正解:
Explanation:
featurizeText: Produces a bag of counts of sequences of consecutive words, called n-grams, from a given
corpus of text. It offers language detection, tokenization, stopwords removing, text normalization and
feature generation.

Question No : 4
DRAG DROP
You need to set the compute context for three different target environments.
Which Statement should you use for each environment? To answer, drag the appropriate statements to the correct execution contexts. Each statement may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.



正解:

Question No : 5
You plan to analyze data on a local computer. To improve performance, you plan to alternate the operation between a Microsoft SQL Server and the local computer.
You need to run complex code on the SQL Server, and then revert to the local compute context.
Which R code segment should you use?



正解:

Question No : 6
You need to use the ScaleR distributed processing in an Apache Hadoop environment.
Which data source should you use?

正解:

Question No : 7
You are planning the compute contexts for your environment.
You need to execute rx-function calls in parallel.
What are three possible compute contexts that you can use to achieve this goal? Each correct answer presents a complete solution.
NOTE: Each correct selection is worth one point.

正解:
Explanation:
https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-hadoop-r-server-compute-contexts

Question No : 8
You have cloud and on-premises resources that include Microsoft SQL Server and a big data environment in Apache Hadoop.
You have 50 billion fact records.
You need to build time series models to execute forecasting reports on the fact records.
What should you use?

正解:

Question No : 9
You have a slow Map Reduce job.
You need to optimize the job to control the number of mapper and runner tasks.
Which function should you use?

正解:

Question No : 10
You have an Apache Hadoop Hive data warehouse. RevoScaleR is not installed.
You need to sort the data according to the variables in the dataset.
What should you do?

正解:

Question No : 11
Note: This question is part of a series of questions that use the same scenario. For your convenience, the scenario is repeated in each question. Each question presents a different goal and answer choices, but the text of the scenario is exactly the same in each question in this series.
Start of repeated scenario
You are developing a Microsoft R Open solution that will leverage the computing power of the database server for some of your datasets.
You are performing feature engineering and data preparation for the datasets.
The following is a sample of the dataset.



End of repeated scenario
You have the following R code.



Which function determines the variable?

正解:

Question No : 12
Note: This question is part of a series of questions that use the same scenario. For your convenience, the scenario is repeated in each question. Each question presents a different goal and answer choices, but the text of the scenario is exactly the same in each question in this series.
Start of repeated scenario
You are developing a Microsoft R Open solution that will leverage the computing power of the database server for some of your datasets.
You are performing feature engineering and data preparation for the datasets.
The following is a sample of the dataset.



End of repeated scenario
You need to analyze the dataset without the missing values. The solution must not remove the missing values from the dataset.
Which R code segment should you use?



正解:

Question No : 13
HOTSPOT
Note: This question is part of a series of questions that use the same scenario. For your convenience, the scenario is repeated in each question. Each question presents a different goal and answer choices, but the text of the scenario is exactly the same in each question in this series.
Start of repeated scenario
You are developing a Microsoft R Open solution that will leverage the computing power of the database server for some of your datasets.
You are performing feature engineering and data preparation for the datasets.
The following is a sample of the dataset.



End of repeated scenario
You need to sort the data from the dataset sample and to remove duplicates by using wkswork1.
Which R code segment should you use? to answer, select the appropriate options in the answer area.
Note: Each correct selection is worth one point.



正解:

Question No : 14
HOTSPOT
Note: This question is part of a series of questions that use the same scenario. For your convenience, the scenario is repeated in each question. Each question presents a different goal and answer choices, but the text of the scenario is exactly the same in each question in this series.
Start of repeated scenario
You are developing a Microsoft R Open solution that will leverage the computing power of the database server for some of your datasets.
You are performing feature engineering and data preparation for the datasets.
The following is a sample of the dataset.



End of repeated scenario
You plan to score some data to create data features to address empty rows.
You have the following R code.



You need to transform the data and overwrite the current dataset.
Which R code segment should you use? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.



正解:

Question No : 15
Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
You need to calculate a measure of central tendency and variability for the variables in a dataset that is grouped by using another categorical variable.
What should you use?

正解:

 / 3