최신 Databricks-Certified-Data-Engineer-Associate 무료덤프 - Databricks Certified Data Engineer Associate
Which of the following can be used to simplify and unify siloed data architectures that are specialized for specific use cases?
정답: D
설명: (DumpTOP 회원만 볼 수 있음)
A data engineer has joined an existing project and they see the following query in the project repository:
CREATE STREAMING LIVE TABLE loyal_customers AS
SELECT customer_id -
FROM STREAM(LIVE.customers)
WHERE loyalty_level = 'high';
Which of the following describes why the STREAM function is included in the query?
CREATE STREAMING LIVE TABLE loyal_customers AS
SELECT customer_id -
FROM STREAM(LIVE.customers)
WHERE loyalty_level = 'high';
Which of the following describes why the STREAM function is included in the query?
정답: C
설명: (DumpTOP 회원만 볼 수 있음)
A single Job runs two notebooks as two separate tasks. A data engineer has noticed that one of the notebooks is running slowly in the Job's current run. The data engineer asks a tech lead for help in identifying why this might be the case.
Which of the following approaches can the tech lead use to identify why the notebook is running slowly as part of the Job?
Which of the following approaches can the tech lead use to identify why the notebook is running slowly as part of the Job?
정답: E
설명: (DumpTOP 회원만 볼 수 있음)
A data engineer is working with two tables. Each of these tables is displayed below in its entirety.
The data engineer runs the following query to join these tables together:
Which of the following will be returned by the above query?

The data engineer runs the following query to join these tables together:
Which of the following will be returned by the above query?

정답: E
설명: (DumpTOP 회원만 볼 수 있음)
Which of the following is hosted completely in the control plane of the classic Databricks architecture?
정답: C
설명: (DumpTOP 회원만 볼 수 있음)
What is stored in a Databricks customer's cloud account?
정답: B
설명: (DumpTOP 회원만 볼 수 있음)
Which file format is used for storing Delta Lake Table?
정답: B
설명: (DumpTOP 회원만 볼 수 있음)
A data engineer has configured a Structured Streaming job to read from a table, manipulate the data, and then perform a streaming write into a new table.
The cade block used by the data engineer is below:

If the data engineer only wants the query to execute a micro-batch to process data every 5 seconds, which of the following lines of code should the data engineer use to fill in the blank?
The cade block used by the data engineer is below:

If the data engineer only wants the query to execute a micro-batch to process data every 5 seconds, which of the following lines of code should the data engineer use to fill in the blank?
정답: C
설명: (DumpTOP 회원만 볼 수 있음)
A data engineer is designing a data pipeline. The source system generates files in a shared directory that is also used by other processes. As a result, the files should be kept as is and will accumulate in the directory. The data engineer needs to identify which files are new since the previous run in the pipeline, and set up the pipeline to only ingest those new files with each run.
Which of the following tools can the data engineer use to solve this problem?
Which of the following tools can the data engineer use to solve this problem?
정답: A
설명: (DumpTOP 회원만 볼 수 있음)
A dataset has been defined using Delta Live Tables and includes an expectations clause:
CONSTRAINT valid_timestamp EXPECT (timestamp > '2020-01-01') ON VIOLATION FAIL UPDATE What is the expected behavior when a batch of data containing data that violates these constraints is processed?
CONSTRAINT valid_timestamp EXPECT (timestamp > '2020-01-01') ON VIOLATION FAIL UPDATE What is the expected behavior when a batch of data containing data that violates these constraints is processed?
정답: B
설명: (DumpTOP 회원만 볼 수 있음)