Snowflake DAA-C01 SnowPro Advanced: Data Analyst Exam Exam Practice Test

Demo: 19 questions
Total 65 questions

Get DAA-C01 Full Access Download DAA-C01 PDF

SnowPro Advanced: Data Analyst Exam Questions and Answers

Question 1

Which Snowflake SQL would a Data Analyst use in a trained Cortex model named forecast_model to retrieve the components that contribute to the predictions?

Options:

forecast_model!SHOW_EVALUATION_METRICS()

forecast_model!SHOW_TRAINING_LOGS()

forecast_model!EXPLAIN_FEATURE_IMPORTANCE()

forecast_model!FORECAST()

Question 2

A Data Analyst is working with three tables:

Which query would return a list of all brokers, a count of the customers each broker has. and the total order amount of their customers (as shown below)?

Options:

Option A

Option B

Option C

Option D

Question 3

A Data Analyst needs to temporarily hide a tile in a dashboard. The data will need to be available in the future, and additional data may be added. Which tile should be used?

Options:

Show/Hide

Duplicate

Delete

Unplace

Question 4

A Data Analyst is working with three tables:

Which query would return a list of all brokers, a count of the customers each broker has. and the total order amount of their customers (as shown below)?

Options:

Option A

Option B

Option C

Option D

Question 5

How can a Data Analyst automatically create a table structure for loading a Parquet file?

Options:

Use the INFER_SCHEMA together with the CREATE TABLE LIKE command.

Use INFER_SCHEMA together with the CREATE TABLE USING TEMPLATE command.

Use the GENERATE_COLUMN_DESCRIPTION with the CREATE TABLE USING TEMPLATE command.

Use the GENERATE_COLUMN_DESCRIPTION with the CREATE TABLE LIKE command.

Question 6

What option would allow a Data Analyst to efficiently estimate cardinality on a data set that contains trillions of rows?

Options:

Count(Distinct *)

HLL(*)

SYSTEM$ESTIMATE

Count(Distinct *)/Count(*)

Question 7

Which Snowflake SQL would a Data Analyst use in a trained Cortex model named forecast_model to retrieve the components that contribute to the predictions?

Options:

forecast_model!SHOW_EVALUATION_METRICS()

forecast_model!SHOW_TRAINING_LOGS()

forecast_model!EXPLAIN_FEATURE_IMPORTANCE()

forecast_model!FORECAST()

Question 8

A Data Analyst has a Parquet file stored in an Amazon S3 staging area. Which query will copy the data from the staged Parquet file into separate columns in the target table?

Options:

Option A

Option B

Option C

Option D

Question 9

Consider the following chart.

What can be said about the correlation for sales over time between the two categories?

Options:

There is a positive correlation.

There is a negative correlation.

There is no correlation. (Selected)

There is a non-linear correlation.

Question 10

Why would a Data Analyst use a dimensional model rather than a single flat table to meet BI requirements for a virtual warehouse? (Select TWO).

Options:

Dimensional modelling will improve query performance over a single table.

Dimensional modelling will save on storage space since it is denormalized.

Combining facts and dimensions in a single flat table limits the scalability and flexibility.

Dimensions and facts allow power users to run ad-hoc analyses.

Snowflake generally performs better with dimensional modelling.

Answer:

C, D

Explanation:

In the field of data warehousing and business intelligence (BI), choosing the right data model is crucial for long-term maintainability and user accessibility. While a single flat table might seem simple initially, dimensional modeling (typically using Star or Snowflake schemas) provides distinct advantages for enterprise analytics.

1. Scalability and Flexibility (Option C)

Combining all attributes into a single flat table creates a highly rigid structure. Every time a new attribute is added to a dimension (e.g., adding a "Promotion Category" to a product), the entire flat table must be rewritten or altered, which is inefficient for large datasets. Furthermore, flat tables often contain redundant data, leading to "update anomalies" where a change in a dimension attribute must be propagated across millions of rows. A dimensional model separates changing business processes (Facts) from the context of those processes (Dimensions), allowing the schema to scale and evolve independently.

2. Ad-hoc Analysis for Power Users (Option D)

Dimensional models are specifically designed to be intuitive for business users and BI tools. By organizing data into Facts (measurable metrics) and Dimensions (descriptive attributes), power users can easily "slice and dice" data across different hierarchies. For example, a user can quickly run an ad-hoc query to compare "Total Sales" (Fact) by "Store Region" (Dimension) and "Calendar Month" (Dimension). This structure provides a predictable and standardized "language" for the data, making it easier for users to build their own reports without needing a Data Analyst to create a custom flat table for every specific request.

Evaluating the Distractors:

Option A and E: These are common misconceptions. Modern cloud data warehouses like Snowflake are often highly optimized for wide "flat" tables due to columnar storage and sophisticated pruning. In many cases, a flat table may actually outperform a multi-table join (dimensional model) because it avoids the computational overhead of the join itself.

Option B: This is factually incorrect. Flat tables are denormalized (repeating data), which generally takes more storage space. Dimensional modeling is a form of normalization that saves space by storing descriptive strings once in a dimension table rather than repeating them for every transaction in a fact table.

Question 11

A Data Analyst created a model called modelX using SNOWFLAKE.ML.FORECAST. The Analyst needs to predict the next few values and save the result directly into tableX. What step does the Analyst need to take after calling the modelX!FORECAST function?

Options:

Load the function call results directly INTO tableX.

Pass the new table as a function argument.

Create the table by querying the RESULT_SCAN.

List the cache content, then use the data saved in the RESULT_SCAN for tableX.

Question 12

Table TB_A with column COL_B contains an ARRAY. Which statement will select the last element of the ARRAY?

Options:

SELECT GET(COL_B, ARRAY_SIZE(COL_B)-1) FROM TB_A;

SELECT COL_B[ARRAY_SIZE(COL_B)] FROM TB_A;

SELECT COL_B[-1] FROM TB_A;

SELECT LAST_VALUE(COL_B) FROM TB_A;

Question 13

A Data Analyst runs this query:

The Analyst men runs this query:

What will be the output?

Options:

Option A

Option B

Option C

Option D

Question 14

A Data Analyst creates a dashboard showing the total credit consumption for each virtual warehouse as follows:

Why is the query failing?

Options:

The query must be executed by a user with the ACCOUNTADMIN role.

INFORMATION_SCHEMA should be used instead of ACCOUNT_USAGE.

DB1 must be authorized to have SELECT access to ACCOUNT_USAGE.

The current database context must be changed to SNOWFLAKE.

Question 15

A Data Analyst has a Parquet file stored in an Amazon S3 staging area. Which query will copy the data from the staged Parquet file into separate columns in the target table?

Options:

Option A

Option B

Option C

Option D

Question 16

A Data Analyst runs a query in a Snowflake worksheet, and selects a numeric column from the result grid. What automatically-generated contextual statistic can be visualized?

Options:

A histogram, displayed for all numeric, date, and time columns

A frequency distribution, displayed for all numeric columns

MIN/MAX values for the column

A key distribution

Question 17

A Data Analyst creates and populates the following table:

create or replace table aggr(v int) as select * from values (1), (2), (3), (4);

The Analyst then executes this query:

select percentile_disc(0.60) within group (order by v desc) from aggr;

What will be the result?

Options:

Question 18

A Data Analyst executes a query in a Snowflake worksheet that returns the total number of daily sales, and the total amount for each sale. How can the Analyst check the distribution of the total amount, without running the query again?

Options:

Click on the column header in the results and review the histogram.

Go to Chart and select a histogram that includes the two variables.

Go to Chart and select a bar chart that contains the two variables.

Call the WIDTH_BUCKET function.

Question 19

What will the following query return?

SELECT * FROM testtable SAMPLE BLOCK (0.012) REPEATABLE (99992);

Options:

A sample of a table in which each block of rows has a 1.2% probability of being included in the sample where repeated elements are allowed.

A sample of a table in which each block of rows has a 0.012% probability of being included in the sample, with the seed set to 99992.

A sample of a table in which each block of rows has a 1.2% probability of being included in the sample, with the seed set to 99992.

A sample containing 99992 records of a table in which each block of rows has a 0.012% probability of being included in the sample.

Load More DAA-C01 Questions

Demo: 19 questions
Total 65 questions

Get DAA-C01 Full Access Download DAA-C01 PDF

Spring Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70percent

Snowflake DAA-C01 SnowPro Advanced: Data Analyst Exam Exam Practice Test

SnowPro Advanced: Data Analyst Exam Questions and Answers

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation: