Microsoft Specialist certification as a profession has an incredible evolution over the last few years. Microsoft 70-475 Designing and Implementing Big Data Analytics Solutions exam is the forerunner in validating credentials against. Here are updated Microsoft 70-475 exam questions, which will help you to test the quality features of DumpsSchool exam preparation material completely free. You can purchase the full product once you are satisfied with the product.
Case Study: 1
Relecloud is a social media company that processes hundreds of millions of social media posts per day and sells advertisements to several hundred companies. Relecloud has a Microsoft SQL Server database named DB1 that stores information about the advertisers. DB1 is hosted on a Microsoft Azure virtual machine.
Relecloud has two main offices. The offices we located in San Francisco and New York City.
The offices connected to each other by using a site-to-site VPN. Each office connects directly to the Internet.
Relecloud modifies the pricing of its advertisements based on trending topics. Topics are considered to be trending if they generate many mentions in a specific country during a 15-minute time frame. The highest trending topics generate the highest advertising revenue.
Relecloud wants to deliver reports to the advertisers by using Microsoft Power BI. The reports will provide real-time data on trending topics, current advertising rates, and advertising costs for a given month.
Relecloud will analyze the trending topics data, and then store the data in a new data warehouse for ad-hoc analysis. The data warehouse is expected to grow at a rate of 1 GB per hour or 8.7 terabytes (TB) per year. The data will be retained for five years for the purpose of long term trending.
Management at Relecloud must be able to view which topics are trending to adjust advertising rates in near real-time.
Relecloud plans to implement a new streaming analytics platform that will report on trending topics. Relecloud plans to implement a data warehouse named DB2.
General technical requirements
Relecloud identifies the following technical requirements:
Social media data must be analyzed to identify trending topics in real time.
The use of Infrastructure as a Service (IaaS) platforms must minimize, whenever possible.
The real-time solution used to analyze the social media data must support selling up and down without service interruption.
Technical requirements for advertisers
Relecloud identifies the following technical requirements for the advertisers
The advertisers must be able to see only their own data in the Power BI reports.
The advertisers must authenticate to Power BI by using Azure Active Directory (Azure AD) credentials.
The advertisers must be able to leverage existing Transact-SQL language knowledge when developing the real-time streaming solution.
Members of the internal advertising sales team at Relecloud must be able to see only the sales data of the advertisers to which they are assigned.
The Internal Relecloud advertising sales team must be prevented from inserting, updating, and deleting rows for the advertisers to which they are not assigned.
The internal Relecloud advertising sales team must be able to use a text file to update the list of advertisers and then to upload the file to Azure Blob storage.
Relecloud identifies the following requirements for DB1:
Data generated by the streaming analytics platform must be stored in DB1.
The user names of the advertisers must be mapped to CustomerID in a table named Table2.
The advertisers in DB1 must be stored in a table named Table1 and must be latest nightly.
The user names of the employees at Relecloud must be mapped to EmployeeID in a table named Table3.
Relecloud identifies the following requirements for DB2:
DB2 must have minimal storage costs.
DB2 must run load processes in parallel.
DB2 must support massively parallel processing.
DB2 must be able to store more than 40 TB of data.
DB2 must support scaling up and down, as required.
Data from DB1 must be archived in DB2 for long-term storage.
All of the reports that are executed from DB2 must use aggregation.
Users must be able to pause DB2 when the data warehouse is not in use.
Users must be able to view previous versions of the data in DB2 by using aggregates.
Relecloud identifies the following requirements for extract, transformation, and load (ETL):
Data movement between DB1 and DB2 must occur each hour.
An email alert must be generated when a failure of any type occurs during ETL processing.
You execute the following code for a table named rls_table1.
You use the following code to create Table1.
The following is a sample of the Streaming data.
User Country Topic Time
user1 USA Topic1 2017-01-01T00:00:01.0000000Z
user1 USA Topic3 2017-01-01T00:02:01.0000000Z
user2 Canada Topic2 2017-01-01T00:01:11.0000000Z
user3 India Topic1 2017-01-01T00:03:14.0000000Z
Which technology should you recommend to meet the technical requirement for analyzing the social media data?
A. Azure Stream Analytics
B. Azure Data Lake Analytics
C. Azure Machine Learning
D. Azure HDInsight Storm clusters
You need to implement a solution that meets the data refresh requirement for DB1.
Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.
You need to create a query that identifies the trending topics.
How should you complete the query? To answer, drag the appropriate values to the correct targets. Each value may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content
NOTE: Each correct selection is worth one point.
You implement DB2.
You need to configure the tables in DB2 to host the data from DB1. The solution must meet the requirements for DB2.
Which type of table and history table storage should you use for the table? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.