Snowflake DSA-C02 today updated questions - Verified by Snowflake Experts

SnowPro Advanced: Data Scientist Certification Exam Questions and Answers

Question 1

Which ones are the correct rules while using a data science model created via External function in Snowflake?

Options:

External functions return a value. The returned value can be a compound value, such as a VARIANT that contains JSON.

External functions can be overloaded.

An external function can appear in any clause of a SQL statement in which other types of UDF can appear.

External functions can accept Model parameters.

Question 2

Which type of Python UDFs let you define Python functions that receive batches of input rows as Pandas DataFrames and return batches of results as Pandas arrays or Series?

Options:

MPP Python UDFs

Scaler Python UDFs

Vectorized Python UDFs

Hybrid Python UDFs

Question 3

What is the formula for measuring skewness in a dataset?

Options:

MEAN - MEDIAN

MODE - MEDIAN

(3(MEAN - MEDIAN))/ STANDARD DEVIATION

(MEAN - MODE)/ STANDARD DEVIATION

Question 4

Which one is the incorrect option to share data in Snowflake?

Options:

a Listing, in which you offer a share and additional metadata as a data product to one or more accounts.

a Direct Marketplace, in which you directly share specific database objects (a share) to another account in your region using Snowflake Marketplace.

a Direct Share, in which you directly share specific database objects (a share) to anoth-er account in your region.

a Data Exchange, in which you set up and manage a group of accounts and offer a share to that group.

Question 5

The most widely used metrics and tools to assess a classification model are:

Options:

Confusion matrix

Cost-sensitive accuracy

Area under the ROC curve

All of the above

Question 6

Which of the learning methodology applies conditional probability of all the variables with respec-tive the dependent variable?

Options:

Reinforcement learning

Unsupervised learning

Artificial learning

Supervised learning

Question 7

Which of the following is a useful tool for gaining insights into the relationship between features and predictions?

Options:

numpy plots

sklearn plots

Partial dependence plots(PDP)

FULL dependence plots (FDP)

Question 8

Which one of the following is not the key component while designing External functions within Snowflake?

Options:

Remote Service

API Integration

UDF Service

Proxy Service

Answer:

Explanation:

Explanation

What is an External Function?

An external function calls code that is executed outside Snowflake.

The remotely executed code is known as a remote service.

Information sent to a remote service is usually relayed through a proxy service.

Snowflake stores security-related external function information in an API integration.

External Function:

An external function is a type of UDF. Unlike other UDFs, an external function does not contain its own code; instead, the external function calls code that is stored and executed outside Snowflake.

Inside Snowflake, the external function is stored as a database object that contains information that Snowflake uses to call the remote service. This stored information includes the URL of the proxy service that relays information to and from the remote service.

Remote Service:

The remotely executed code is known as a remote service.

The remote service must act like a function. For example, it must return a value.

Snowflake supports scalar external functions; the remote service must return exactly one row for each row received.

Proxy Service:

Snowflake does not call a remote service directly. Instead, Snowflake calls a proxy service, which relays the data to the remote service.

The proxy service can increase security by authenticating requests to the remote service.

The proxy service can support subscription-based billing for a remote service. For example, the proxy service can verify that a caller to the remote service is a paid subscriber.

The proxy service also relays the response from the remote service back to Snowflake.

Examples of proxy services include:

Amazon API Gateway.

Microsoft Azure API Management service.

API Integration:

An integration is a Snowflake object that provides an interface between Snowflake and third-party services. An API integration stores information, such as security information, that is needed to work with a proxy service or remote service.

An API integration is created with the CREATE API INTEGRATION command.

Users can write and call their own remote services, or call remote services written by third parties. These remote services can be written using any HTTP server stack,including cloud serverless compute services such as AWS Lambda.

Question 9

Which of the following Snowflake parameter can be used to Automatically Suspend Tasks which are running Data science pipelines after specified Failed Runs?

Options:

SUSPEND_TASK

SUSPEND_TASK_AUTO_NUM_FAILURES

SUSPEND_TASK_AFTER_NUM_FAILURES

There is none as such available.

Question 10

Which of the following cross validation versions may not be suitable for very large datasets with hundreds of thousands of samples?

Options:

k-fold cross-validation

Leave-one-out cross-validation

Holdout method

All of the above

Question 11

What is the risk with tuning hyper-parameters using a test dataset?

Options:

Model will overfit the test set

Model will underfit the test set

Model will overfit the training set

Model will perform balanced

Question 12

Select the Correct Statements regarding Normalization?

Options:

Normalization technique uses minimum and max values for scaling of model.

Normalization technique uses mean and standard deviation for scaling of model.

Scikit-Learn provides a transformer RecommendedScaler for Normalization.

Normalization got affected by outliers.

Question 13

Mark the correct steps for saving the contents of a DataFrame to aSnowflake table as part of Moving Data from Spark to Snowflake?

Options:

Step 1.Use the PUT() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the NAME() method.

Step 3.Use the dbtable option to specify the table to which data is written.

Step 4.Specify the connector options using either the option() or options() method.

Step 5.Use the save() method to specify the save mode for the content.

Step 1.Use the PUT() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method.

Step 3.Specify the connector options using either the option() or options() method.

Step 4.Use the dbtable option to specify the table to which data is written.

Step 5.Use the save() method to specify the save mode for the content.

Step 1.Use the write() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method.

Step 3.Specify the connector options using either the option() or options() method.

Step 4.Use the dbtable option to specify the table to which data is written.

Step 5.Use the mode() method to specify the save mode for the content.

(Correct)

Step 1.Use the writer() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method.

Step 3.Use the dbtable option to specify the table to which data is written.

Step 4.Specify the connector options using either the option() or options() method.

Step 5.Use the save() method to specify the save mode for the content.

Question 14

Which command is used to install Jupyter Notebook?

Options:

pip install jupyter

pip install notebook

pip install jupyter-notebook

pip install nbconvert

Question 15

Mark the incorrect statement regarding usage of Snowflake Stream & Tasks?

Options:

Snowflake automatically resizes and scales the compute resources for serverless tasks.

Snowflake ensures only one instance of a task with a schedule (i.e. a standalone task or the root task in a DAG) is executed at a given time. If a task is still running when the next scheduled execution time occurs, then that scheduled time is skipped.

Streams support repeatable read isolation.

An standard-only stream tracks row inserts only.

Question 16

Which ones are the type of visualization used for Data exploration in Data Science?

Options:

Heat Maps

Newton AI

Feature Distribution by Class

2D-Density Plots

Sand Visualization

Question 17

Data Scientist used streams in ELT (extract, load, transform) processes where new data inserted in-to a staging table is tracked by a stream. A set of SQL statements transform and insert the stream contents into a set of production tables. Raw data is coming in the JSON format, but for analysis he needs to transform it into relational columns in the production tables. which of the following Data transformation SQL function he can used to achieve the same?

Options:

He could not apply Transformation on Stream table data.

lateral flatten()

METADATA$ACTION ()

Transpose()

Load More DSA-C02 Questions

Weekend Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70percent

Snowflake DSA-C02 SnowPro Advanced: Data Scientist Certification Exam Exam Practice Test

SnowPro Advanced: Data Scientist Certification Exam Questions and Answers

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation: