Tiếp theo, mình share tiếp đáp án phần Cloud Career Pathway – Data Integration Specialist.
Big Data
1) True or False.
Big data is all about collecting, storing, processing, and visualizing massive amounts of data.
Group of answer choices
- True
- False
2) True or False.
Hadoop splits your data (files) into multiple chunks and assigns a single map task to each part.
Group of answer choices
- True
- False
3) What is the port number for Task Tracker?
Group of answer choices
- 50070
- 50050
- 50040
- 50060
4) Which of the following are scenarios for using Amazon Kinesis Streams for analytics? (Select all that apply)
Group of answer choices
- Log and data feed intake and processing
- Real-time metrics and reporting
- Data retrieval and storage
- Real-time data analytics
5) What type of calculations does Amazon QuickSight enable? (Select all that apply)
Group of answer choices
- set calculations as unions and intersect
- relational and logical calculations
- minimum, maximum and median
- sum, count and average,
6) What is distributed cache?
Group of answer choices
- The distributed cache is special component on data node that will cache frequently used data for faster client response. It is used during map step.
- The distributed cache is special component on name node that will cache frequently used data for faster client response. It is used during reduce step.
- The distributed cache is a component that allows developers to deploy jars for Map-Reduce processing.
- The distributed cache is a component that caches java objects.
7) Which AWS service is a managed cluster platform that simplifies running big data frameworks, such as Apache Hadoop and Apache Spark, on AWS to process and analyze vast amounts of data?
Group of answer choices
- Amazon Machine Learning
- Amazon EMR (Amazon EMR)
- Amazon QuickSight
- Amazon Kinesis Streams
8) Which AWS service allows you to build Amazon Kinesis applications and stream data to power real-time dashboards, generate alerts, and implement dynamic pricing and advertising?
Group of answer choices
- Amazon Elastic MapReduce
- AWS Lambda
- Amazon Kinesis Client Library
- Amazon Redshift
9) What is the port number for Job Tracker?
Group of answer choices
- 50040
- 50030
- 50050
- 50020
10) Hadoop Framework works on which of the the following two core components: (select all that apply)
Group of answer choices
- Unstructured Data Files
- Hadoop MapReduce
- Hadoop Distributed File System
- Structured Data Files
11) Which of the following is NOT a data aggregation best practice?
Group of answer choices
- Aggregated Data Size
- Data Compression Algorithms
- Fragment data
- Data Partitioning
12) ____________ is a standard language for accessing and manipulating databases.
Group of answer choices
- SQL
- JavaScript
- HTML
- CSS
13) What types of visualizations are NOT supported by Amazon QuickSight? Select all that apply.
Group of answer choices
- Box plots
- Aggregation
- Comparison and distribution
- Correlation
14) Which AWS service is most useful wherever there is a need to move data rapidly off producers (data sources) and continuously process it?
Group of answer choices
- AWS Lambda
- Amazon Kinesis Streams
- Amazon DynamoDB
- Amazon QuickSight
Data and Databases
1) A ____________ is a collection of related data entries and it consists of columns and rows.
Group of answer choices
- table
- data log
- cluster
- stack
2) How are Reserved Instances different from On-Demand DB Instances?
Group of answer choices
- CPU Resources
- Storage Capacity
- Memory Resources
- Billing Method
3) What does SQL stand for?
Group of answer choices
- Standard Query Language
- Strong Question Language
- Structured Question Language
- Structured Query Language
4) Using the scenario below order each task as Step 1, Step 2, etc. to complete the scenario activity.
Scenario: Create a NoSQL Table using Amazon DynamoDB.
Group of answer choices
- In the Table name field, type the name of your data file. -> Step 2
- Type a data attribute in the Partition Key field to spread data across partitions for scalability. -> Step 3
- In the DynamoDB console, click Create Table. -> Step 1
- Check the Add sort key box. Type a data attribute in the Sort Key field to enable easy sorting. -> Step 4
- Click Create. When the created table is ready to use, it appears in in the table list with a checkbox. -> Step 5
Software Development
1) ____________ is defined as the set of activities that revolve around a new software product, from its inception to when the product matures and perhaps retires.
Group of answer choices
- agile deployment
- lifecycle management
- release management
- integration testing
2) _______________ is a DevOps software development practice where code changes are automatically built, tested, and prepared for a release to production.
Group of answer choices
- Beanstalk
- Continuous Delivery
- Drupal
- Continuous Integration
3) Which of the following is NOT a General CMake Variable?
Group of answer choices
- BUILD_ONLY
- TARGET_HOME
- REGENERATE_CLIENTS
- ADD_CUSTOM_CLIENTS
4) Which of the following PHP type of variables has only one value: NULL?
Group of answer choices
- Null
- Doubles
- Booleans
- Strings
5) JavaScript statements often start with a keyword to identify the JavaScript action to be performed. Which of the following is NOT a JavaScript keyword?
Group of answer choices
- Debugger
- Write
- Break
- Function
6) What is fault-tolerance?
Group of answer choices
- A block of code designed to perform a particular task
- A special variable, which can hold mor
e than one value at a time - The ability for a system to remain in operation even if some of the components used to build the system fail
- Used to perform arithmetic on numbers (literals or variables)
7) ______________ provides the information required to launch an instance, which is a virtual server in the cloud.
Group of answer choices
- Index file
- Application Programming Interface (API)
- Amazon Machine Image (AMI)
- CloudFront
8) AWS provides libraries, sample code, tutorials, and other resources for (7) programming languages software developers. Which of the following is NOT a programming language AWS provides resources for?
Group of answer choices
- Python
- Fortran
- Windows and .NET
- JavaScript
9) Which of the following PHP function checks if a specified value exists in an array?
Group of answer choices
- key
- in_array
- extract
- krsort
10) Which of the following is NOT a JavaScript statement?
Group of answer choices
- Keywords
- Functions
- Comments
- Expressions
11) ___________ is a fully-managed source control service that makes it easy for companies to host secure and highly scalable private Git repositories.
Group of answer choices
- AWS CodePipeline
- AWS Command Line Interface (CLI)
- AWS CodeCommit
- AWS CodeDeploy
12) ____________ is a standard language for accessing and manipulating databases.
Group of answer choices
- HTML
- CSS
- SQL
- JavaScript
13) True or False.
AWS SDK for C++ uses a common schema to generate code that service as service clients and high-level APIs.
Group of answer choices
- True
- False
Final Assessment
1) What is the port number for Task Tracker?
Group of answer choices
- 50050
- 50070
- 50060
- 50040
2) True or False.
Hadoop splits your data (files) into multiple chunks and assigns a single map task to each part.
Group of answer choices
- True
- False
3) True or False.
Big data is all about collecting, storing, processing, and visualizing massive amounts of data.
Group of answer choices
- True
- False
4) ____________ is a standard language for accessing and manipulating databases.
Group of answer choices
- HTML
- CSS
- SQL
- JavaScript
5) What types of visualizations are NOT supported by Amazon QuickSight? Select all that apply.
Group of answer choices
- Comparison and distribution
- Aggregation
- Correlation
- Box plots
6) What is a block in a HDFS (Hadoop Distributed File System)?
Group of answer choices
- The minimum amount of data that can be read or written
- The heart of the HDFS file system which manages the metadata
- Track of the latest checkpoint of the namespace
- A log of changes that have been made to the namespace since checkpoint
7) What is distributed cache?
Group of answer choices
- The distributed cache is special component on data node that will cache frequently used data for faster client response. It is used during map step.
- The distributed cache is a component that allows developers to deploy jars for Map-Reduce processing.
- The distributed cache is a component that caches java objects.
- The distributed cache is special component on name node that will cache frequently used data for faster client response. It is used during reduce step.
8) Which of the following is NOT a data aggregation best practice?
Group of answer choices
- Aggregated Data Size
- Fragment data
- Data Compression Algorithms
- Data Partitioning
9) Which AWS service is a managed cluster platform that simplifies running big data frameworks, such as Apache Hadoop and Apache Spark, on AWS to process and analyze vast amounts of data?
Group of answer choices
- Amazon Machine Learning
- Amazon Kinesis Streams
- Amazon QuickSight
- Amazon EMR (Amazon EMR)
10) What is the port number for Job Tracker?
Group of answer choices
Group of answer choices
- 50040
- 50030
- 50020
- 50050
11) By default, how many Amazon RDS DB instances can a customer have running?
Group of answer choices
Group of answer choices
- 20
- 30
- 40
- 100
12) With SQL, how can you return the number of records in the “Persons” table?
Group of answer choices
- SELECT COUNT(*) FROM Persons
- SELECT COLUMNS(*) FROM Persons
- SELECT COUNT() FROM Persons
- SELECT COLUMNS() FROM Persons
13) What is Amazon RDS?
Group of answer choices
Group of answer choices
- a managed service that makes it easy to set up, operate, and scale a relational database in the cloud
- a managed cluster platform that simplifies running big data frameworks
- a web service that makes it easy to set up, manage, and scale distributed in-memory cache environments in the cloud
- a fully managed NoSQL database service that provides fast and predictable performance with seamless scalability
14) How are Reserved Instances different from On-Demand DB Instances?
Group of answer choices
Group of answer choices
- Billing Method
- Memory Resources
- CPU Resources
- Storage Capacity
15) Which SQL statement is used to create a table in a database?
Group of answer choices
Group of answer choices
- CREATE DATABASE TABLE
- CREATE DB
- CREATE DATABASE TAB
- CREATE TABLE
16) Using the scenario below order each task as Step 1, Step 2, etc. to complete the scenario activity.
Scenario: Create a NoSQL Table using Amazon DynamoDB.
Group of answer choices
- In the Table name field, type the name of your data file. -> Step 2
- Type a data attribute in the Partition Key field to spread data across partitions for scalability. -> Step 3
- In the DynamoDB console, click Create Table. -> Step 1
- Check the Add sort key box. Type a data attribute in the Sort Key field to enable easy sorting. -> Step 4
- Click Create. When the created table is ready to use, it appears in in the table list with a checkbox. -> Step 5
17) Within the DynamoDB console, what is the function of the play button arrow ►?
- query a table
- run the code
- create a table
- create a database
18) What is a primary key in Amazon DynamoDB?
Group of answer choices
- a table attribute
- a data stack
- a unique identifier for each item in the table
- a data record in a table
19) What command is the following AWS Command Line Interface script executing?
Command Line script:
> pip install –upgrade awscli
Group of answer
choices
choices
- To upgrade an existing AWS CLI installation
- To download the installation script
- To test the AWS CLI Installation
- To check to see if Python is already installed
20) True or False.
JavaScript ignores multiple spaces.
Group of answer choices
JavaScript ignores multiple spaces.
Group of answer choices
- True
- False
21) On Microsoft Windows operating system and environment which AWS CLI installation method should you use?
Group of answer choices
- awscli library
- Pip
- MSI installer
- API Gateway
22) True or False.
Java is a case sensitive programming language.
Group of answer choices
Java is a case sensitive programming language.
Group of answer choices
- True
- False
23) Which of the following function convert a string to an int in python?
Group of answer choices
- int(x [,base])
- float(x)
- long(x [,base] )
- str(x)
24) AWS provides a range of Java resources including blogs, community forums and sample code. Java code samples can be found at which of the following AWS resources?
Group of answer choices
- AWS SDK for Java Getting Started Guide
- AWS Java Development Center
- AWS SDK for Java on GitHub
- AWS Java Development Forum
25) What does HTML stand for?
Group of answer choices
- Hedge Text Metric Language
- Hard To Measure Language
- Highlight Text Markup Language
- Hyper Text Markup Language
26) Amazon QuickSight uses a ___________ to perform advanced calculations and render visualizations rapidly.
Group of answer choices
- Amazon EMR
- SPICE
- AWS Lambda
- Amazon Kinesis
27) Which of the following is NOT a commonly defined input format in Hadoop?
- Key Value Input Format
- Sequence File Input Format
- Text Input Format
- Hadoop Streaming Format
28) With SQL, how do you select a column named “FirstName” from a table named “Persons”?
Group of answer choices
- SELECT Persons.FirstName
- EXTRACT Persons.FirstName
- EXTRACT FirstName FROM Persons
- SELECT FirstName FROM Persons
29) True or False.
JavaScript statements are separated by colons.
Group of answer choices
- True
- False
30) Which of the following is a core library feature of AWS SDK for C++?
Group of answer choices
- Logging
- Credential Management
- All answer choices
- Memory Allocation
31) What is the name of the credential management interface within AWS SDK for C++?
Group of answer choices
- AWSCredentialsManager
- AWSCredentialsChain
- AWSCredentialsProvider
- AWSManager
32) Which of the following is NOT a General CMake Variable?
Group of answer choices
- TARGET_HOME
- BUILD_ONLY
- REGENERATE_CLIENTS
- ADD_CUSTOM_CLIENTS
33) Which of the following is NOT a main component of a Hadoop Application?
Group of answer choices
Group of answer choices
- Volume
- YARN
- Hadoop MapReduce
- Hadoop Distributed File System
- Hadoop Common
34) What is the port number for NameNode?
Group of answer choices
- 50070
- 50040
- 50020
- 50030
35) Which of the following is NOT a way to import data to Amazon RDS?
Group of answer choices
- copy/paste utility for RDS
- SQL Loader for Oracle
- with the mysqldump
- mysqlimport utilities for MySQL
36) True or False.
CMake options are variables that can either be ON or OFF, with a controllable default.
Group of answer choices
CMake options are variables that can either be ON or OFF, with a controllable default.
Group of answer choices
- True
- False
37) True or False.
The NOT NULL constraint enforces a column to not accept null values.
Group of answer choices
- True
- False
38) The following SQL statement contains how many columns?
SQL STATEMENT:
CREATE TABLE Persons
(
PersonID int,
LastName varchar(255),
FirstName varchar(255),
Address varchar(255),
City varchar(255)
);
Group of answer choices
- 4
- 5
- 3
- 2
39) What is the output of L[-2] if L = [1,2,3] in python?
Group of answer choices
None of these.
- 2
- 3
- 1
40) Which of the following is correct about PHP?
Group of answer choices
- You add, delete, modify elements within your database thru PHP.
- PHP can handle forms, i.e. gather data from files, save data to a file, thru email you can send data, return data to the user.
- PHP performs system functions, i.e. from files on a system it can create, open, read, write, and close them.
- All of the above
41) What are SQL constraints?
- delete data columns
- insert data rows
- classify the data
- specify rules for the data in a table
42) What command is the following AWS Command Line Interface script executing?
Command Line script:
$ aws s3 sync myfolder s3://mybucket/myfolder –exclude *.tmp
upload: myfolder/newfile.txt to s3://mybucket/myfolder/newfile.txt
Group of answer choices
- recursive uploads and downloads of multiple files
- synchronize the contents of a local folder with a copy in an S3 bucket.
- view supported services
- view the contents of your S3 buckets
43) True or False.
HTML is a programming language.
Group of answer choices
- True
- False
Data Integration Specialist Final Project v1
1) Select the option that best completes the sentence below. Select Submit when you are done.
A Data Integration Specialist may be involved in the process of improving a company’s data integration and deployment. Data management is changing as Big Data grows and creates demands that cannot be solved by traditional databases. The three biggest areas of growth are _____.
- volume, vectors, and variety
- value, velocity, and volume
- value, vectors, and variety
- volume, variety, and velocity
2) Select the option that best completes the sentence below. Select Submit when you are done.
- Ec2 and HADOOP
- EC2 and Redshift
- EC2 and S3
- Redshift and S3
3) Math the following descriptions into their corresponding data characteristic, differentiating between a Data Warehouse and a Data Lake.
4) SQL Server tables should never contain duplicate rows nor non-unique primary keys (PK). You created a new unique index on a table and it aborted because duplicate rows were found. How would you delete the
duplicate rows? Write the SQL code required to resolve this task.
duplicate rows? Write the SQL code required to resolve this task.
Final Project
Scenario: A customer wants to track access to their Amazon Simple Storage Service (S3) buckets and also use this information for their internal security and access audits.
Draft a detailed solution response – for Parts A and B – identifying and explaining your recommended solution and why you selected a specific solution.
Part A: Identify and explain which AWS service will meet the Customer’s requirements.
Part B: Identify (2) best practices for securing Amazon S3 buckets.
Project Title: Cybersecurity: Securing Amazon S3
Chúc các bạn thành công!
DienDan.Edu.Vn Cám ơn bạn đã quan tâm và rất vui vì bài viết đã đem lại thông tin hữu ích cho bạn.DienDan.Edu.Vn! là một website với tiêu chí chia sẻ thông tin,... Bạn có thể nhận xét, bổ sung hay yêu cầu hướng dẫn liên quan đến bài viết. Vậy nên đề nghị các bạn cũng không quảng cáo trong comment này ngoại trừ trong chính phần tên của bạn.Cám ơn.