Weekend Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70percent

Oracle 1z0-1110-25 Oracle Cloud Infrastructure 2025 Data Science Professional Exam Practice Test

Demo: 47 questions
Total 158 questions

Oracle Cloud Infrastructure 2025 Data Science Professional Questions and Answers

Question 1

Which statement is true about origin management in Web Application Firewall (WAF)?

Options:

A.

Multiple origins can be defined

B.

Only a single origin can be active for a WAF

C.

Only statement B is true

D.

Both the statements are false

E.

Both the statements are true

F.

Only statement A is true

Question 2

As a data scientist for a hardware company, you have been asked to predict the revenue demand for the upcoming quarter. You develop a time series forecasting model to analyze the data. Select the correct sequence of steps to predict the revenue demand values for the upcoming quarter.

Options:

A.

Verify, prepare model, deploy, save, predict

B.

Predict, deploy, save, verify, prepare model

C.

Prepare model, deploy, verify, save, predict

D.

Prepare model, verify, save, deploy, predict

Question 3

You want to make your model more parsimonious to reduce the cost of collecting and processing data. You plan to do this by removing features that are highly correlated. You would like to create a heatmap that displays the correlation so that you can identify candidate features to remove. Which Accelerated Data Science (ADS) SDK method would be appropriate to display the correlation between Continuous and Categorical features?

Options:

A.

corr()

B.

correlation_ratio_plot()

C.

pearson_plot()

D.

cramersv_plot()

Question 4

What is a conda environment?

Options:

A.

A system that manages package dependencies

B.

A collection of kernels

C.

An open-source environment management system

D.

An environment deployment system on Oracle AI

Question 5

Which type of firewalls are designed to protect against web application attacks, such as SQL injection and cross-site scripting?

Options:

A.

Stateful inspection firewall

B.

Web Application Firewall

C.

Incident firewall

D.

Packet filtering firewall

Question 6

You want to write a Python script to create a collection of different projects for your data science team. Which Oracle Cloud Infrastructure (OCI) Data Science interface would you use?

Options:

A.

The OCI Software Development Kit (SDK)

B.

OCI Console

C.

Command Line Interface (CLI)

D.

Mobile App

Question 7

Which of the following programming languages are most widely used by data scientists?

Options:

A.

C and C++

B.

Python, R, and SQL

C.

Java and JavaScript

Question 8

Which of these protects customer data at rest and in transit in a way that allows customers to meet their security and compliance requirements for cryptographic algorithms and key management?

Options:

A.

Security controls

B.

Customer isolation

C.

Data encryption

D.

Identity Federation

Question 9

You have an embarrassingly parallel or distributed batch job with a large amount of data running using Data Science Jobs. What would be the best approach to run the workload?

Options:

A.

Create a job in Data Science Jobs and then start the number of simultaneous job runs required for your workload

B.

Create a new job for every job run that you have to run in parallel, because the Data Science Job service can have only one job per job

C.

Create the job in Data Science Jobs and start a job run. When it is done, start a new job run until you achieve the number of runs required

D.

Reconfigure the job run because Data Science Jobs does not support embarrassingly parallel

Question 10

You are building a model and need input that represents data as morning, afternoon, or evening. However, the data contains a timestamp. What part of the Data Science lifecycle would you be in when creating the new variable?

Options:

A.

Model type selection

B.

Model validation

C.

Data access

D.

Feature engineering

Question 11

You are a data scientist leveraging the Oracle Cloud Infrastructure (OCI) Language AI service for various types of text analyses. Which TWO capabilities can you utilize with this tool?

Options:

A.

Table extraction

B.

Punctuation correction

C.

Sentence diagramming

D.

Topic classification

E.

Sentiment analysis

Question 12

You are a data scientist working inside a notebook session and you attempt to pip install a package from a public repository that is not included in your conda environment. After running this command, you get a network timeout error. What might be missing from your networking configuration?

Options:

A.

FastConnect to an on-premises network

B.

Primary Virtual Network Interface Card (VNIC)

C.

NAT Gateway with public internet access

D.

Service Gateway with private subnet access

Question 13

You are working as a data scientist for a healthcare company. They decided to analyze the data to find patterns in a large volume of electronic medical records. You are asked to build a PySpark solution to analyze these records in a JupyterLab notebook. What is the order of recommended steps to develop a PySpark application in OCI Data Science?

Options:

A.

Launch a notebook session, configure core-site.xml, install a PySpark conda environment, develop your PySpark application, create a Data Flow application with the Accelerated Data Science (ADS) SDK

B.

Configure core-site.xml, install a PySpark conda environment, create a Data Flow application with the Accelerated Data Science (ADS) SDK, develop your PySpark application, launch a notebook session

C.

Install a Spark conda environment, configure core-site.xml, launch a notebook session, create a Data Flow application with the Accelerated Data Science (ADS) SDK, develop your PySpark application

D.

Launch a notebook session, install a PySpark conda environment, configure core-site.xml, develop your PySpark application, create a Data Flow application with the Accelerated Data Science (ADS) SDK

Question 14

Select two reasons why it is important to rotate encryption keys when using Oracle Cloud Infrastructure (OCI) Vault to store credentials or other secrets.

Options:

A.

Key rotation allows you to encrypt no more than five keys at a time

B.

Key rotation improves encryption efficiency

C.

Periodically rotating keys makes it easier to reuse keys

D.

Key rotation reduces risk if a key is ever compromised

E.

Periodically rotating keys limits the amount of data encrypted by one key version

Question 15

What is the first step in the data science process?

Options:

A.

Collecting data and preparing it for analysis

B.

Experimenting with and tuning different analytical models

C.

Defining an analytical hypothesis that could provide business value

D.

Working with data owners

Question 16

Which OCI service provides a scalable environment for developers and data scientists to run Apache Spark applications at scale?

Options:

A.

Data Science

B.

Anomaly Detection

C.

Data Labeling

D.

Data Flow

Question 17

You have just started as a data scientist at a healthcare company. You have been asked to analyze and improve a deep neural network model, which was built based on the electrocardiogram records of patients. There are no details about the model framework that was built. What would be the best way to find more details about the machine learning models inside the model catalog?

Options:

A.

Refer to the code inside the model

B.

Check for model taxonomy details

C.

Check for metadata tags

D.

Check for provenance details

Question 18

Which statement about resource principals is true?

Options:

A.

When you authenticate using a resource principal, you need to create and manage credentials to access OCI resources.

B.

A resource principal is not a secure way to authenticate to resources, compared to the OCI configuration and API key approach.

C.

The Data Science service does not provide authentication via a notebook session’s or job run’s resource principal to access other OCI resources.

D.

A resource principal is a feature of IAM that enables resources to be authorized principal actors.

Question 19

Which model has an open-source, open model format that allows you to run machine learning models on different platforms?

Options:

A.

PySpark

B.

PyTorch

C.

TensorFlow

D.

ONNX

Question 20

Six months ago, you created and deployed a model that predicts customer churn for a call centre. Initially, it was yielding quality predictions. However, over the last two months, users are questioning the credibility of the predictions. Which TWO methods would you employ to verify the accuracy of the model?

Options:

A.

Retrain the model

B.

Validate the model using recent data

C.

Drift monitoring

D.

Redeploy the model

E.

Operational monitoring

Question 21

As a data scientist, you are tasked with creating a model training job that is expected to take different hyperparameter values on every run. What is the most efficient way to set those parameters with Oracle Data Science Jobs?

Options:

A.

Create a new job every time you need to run your code and pass the parameters as environment variables

B.

Create a new job by setting the required parameters in your code and create a new job for every code change

C.

Create your code to expect different parameters either as environment variables or as command-line arguments, which are set on every job run with different values

D.

Create your code to expect different parameters as command-line arguments and create a new job every time you run the code

Question 22

Which step is unique to MLOps, as opposed to DevOps?

Options:

A.

Continuous deployment

B.

Continuous integration

C.

Continuous delivery

D.

Continuous training

Question 23

You are a data scientist using Oracle AutoML to produce a model and you are evaluating the score metric for the model. Which of the following TWO prevailing metrics would you use for evaluating a multiclass classification model?

Options:

A.

Recall

B.

Mean squared error

C.

F1 Score

D.

R-Squared

E.

Explained variance score

Question 24

You are using a custom application with third-party APIs to manage application and data hosted in an Oracle Cloud Infrastructure (OCI) tenancy. Although your third-party APIs don’t support OCI’s signature-based authentication, you want them to communicate with OCI resources. Which authentication option must you use to ensure this?

Options:

A.

OCI username and password

B.

API Signing Key

C.

SSH Key Pair with 2048-bit algorithm

D.

Auth Token

Question 25

Which is NOT a part of Observability and Management Services?

Options:

A.

Event Services

B.

OCI Management Service

C.

Logging Analytics

D.

Logging

Question 26

You are a data scientist with a set of text and image files that need annotation, and you want to use Oracle Cloud Infrastructure (OCI) Data Labeling. Which of the following THREE annotation classes are supported by the tool?

Options:

A.

Object detection

B.

Named entity extraction

C.

Classification (single/multi-label)

D.

Key-point and landmark

E.

Polygonal segmentation

F.

Semantic segmentation

Question 27

You are attempting to save a model from a notebook session to the model catalog by using the Accelerated Data Science (ADS) SDK, with resource principal as the authentication signer, and you get a 404 authentication error. Which two should you look for to ensure permissions are set up correctly?

Options:

A.

The model artifact is saved to the block volume of the notebook session

B.

A dynamic group has rules that match the notebook sessions in its compartment

C.

The policy for your user group grants manage permissions for the model catalog in this compartment

D.

The policy for a dynamic group grants manage permissions for the model catalog in this compartment

E.

The networking configuration allows access to Oracle Cloud Infrastructure services through a Service Gateway

Question 28

Which activity is NOT a part of the machine learning life cycle?

Options:

A.

Database Management

B.

Model Deployment

C.

Modeling

D.

Data Access

Question 29

You have received machine learning model training code, without clear information about the optimal shape to run the training on. How would you proceed to identify the optimal compute shape for your model training that provides a balanced cost and processing time?

Options:

A.

Start with a smaller shape and monitor the job run metrics and time required to complete the model training. If the compute shape is not fully utilized, tune the model parameters, and rerun the job. Repeat the process until the shape resources are fully utilized.

B.

Start with the strongest compute shape Jobs support and monitor the job run metrics and time required to complete the model training. Tune the model so that it utilizes as much compute resources as possible, even at an increased cost.

C.

Start with a small shape and monitor the utilization metrics and time required to complete the model training. If the compute shape is fully utilized, change to compute that has more resources and rerun the job. Repeat the process until the processing time does not improve.

D.

Start with a random compute shape and monitor the utilization metrics and time required to finish the model training. Perform model training optimization and performance tests in advance to identify the right compute shape before running the model training as a job.

Question 30

You are creating an Oracle Cloud Infrastructure (OCI) Data Science job that will run on a recurring basis in a production environment. This job will pick up sensitive data from an Object Storage Bucket, train a model, and save it to the model catalog. How would you design the authentication mechanism for the job?

Options:

A.

Create a pre-authenticated request (PAR) for the Object Storage bucket and use that in the job code

B.

Use the resource principal of the job run as the signer in the job code, ensuring there is a dynamic group for this job run with appropriate access to Object Storage and the model catalog

C.

Package your personal OCI config file and keys in the job artifact

D.

Store your personal OCI config file and keys in the Vault, and access the Vault through the job run resource principal

Question 31

As a data scientist, you require a pipeline to train ML models. When can a pipeline run be initiated?

Options:

A.

Pipeline can be initiated once it is created.

B.

Pipeline can be initiated during the pipeline run state.

C.

Pipeline can be initiated after the active state.

D.

Pipeline can be initiated before the active state.

Question 32

Which statement about Oracle Cloud Infrastructure Data Science Jobs is true?

Options:

A.

Jobs provisions the infrastructure to run a process on-demand

B.

Jobs comes with a set of standard tasks that cannot be customized

C.

You must create and manage your own Jobs infrastructure

D.

You must use a single Shell/Bash or Python artifact to run a job

Question 33

What happens when a notebook session is deactivated?

Options:

A.

Compute cost increases due to frequent deactivation

B.

The data on boot volume is preserved

C.

The underlying compute instance stops

D.

The block volume attached to the notebook is permanently deleted

Question 34

Where do calls to stdout and stderr from score.py go in the model deployment?

Options:

A.

The file that was defined for them on the virtual machine (VM)

B.

The OCI Console

C.

The OCI Cloud Shell, which can be accessed from the console

D.

The predict log in the Oracle Cloud Infrastructure (OCI) Logging service as defined in the deployment

Question 35

You have created a conda environment in your notebook session. This is the first time you are working with published conda environments. You have also created an Object Storage bucket with permission to manage the bucket. Which TWO commands are required to publish the conda environment?

Options:

A.

odsc conda publish --slug

B.

odsc conda list --override

C.

odsc conda init --bucket_namespace --bucket_name

D.

odsc conda create --file manifest.yaml

E.

conda activate /home/datascience/conda/

Question 36

Which statement about logs for Oracle Cloud Infrastructure Jobs is true?

Options:

A.

Each job run sends outputs to a single log for that job

B.

Integrating data science jobs resources with logging is mandatory

C.

All stdout and stderr are automatically stored when automatic log creation is enabled

D.

Logs are automatically deleted when the job and job run is deleted

Question 37

Which of the following best describes the principal goal of data science?

Options:

A.

To collect and archive exhaustive datasets from various source systems for corporate record-keeping uses.

B.

To mine and analyze large amounts of data in order to uncover information that can be used for operational improvements and business gains.

C.

To collect and prepare data for use as part of analytics applications.

D.

Data science is focused on output of the analysis.

Question 38

You want to create a user group for a team of external data science consultants. The consultants should only have the ability to see Data Science resource details but not have the ability to create, delete, or update Data Science resources. What verb should you write in the policy?

Options:

A.

Use

B.

Inspect

C.

Manage

D.

Read

Question 39

Which of these options allow the sharing and loading back of ML models into a notebook session?

Options:

A.

Model provenance

B.

Model taxonomy

C.

Model deployment

D.

Model catalog

Question 40

You have a dataset with fewer than 1000 observations, and you are using Oracle AutoML to build a classifier. While visualizing the results of each stage of the Oracle AutoML pipeline, you notice that no visualization has been generated for one of the stages. Which stage is not visualized?

Options:

A.

Feature selection

B.

Algorithm selection

C.

Adaptive sampling

D.

Hyperparameter tuning

Question 41

What is feature engineering in machine learning used for?

Options:

A.

To perform parameter tuning

B.

To interpret ML models

C.

To transform existing features into new ones

D.

To help understand the dataset features

Question 42

Which statement accurately describes an aspect of machine learning models?

Options:

A.

Model performance degrades over time due to changes in data.

B.

Static predictions become increasingly accurate over time.

C.

Data models are more static and generally require fewer updates than software code.

D.

A high-quality model will not need to be retrained as new information is received.

Question 43

As a data scientist, you are trying to automate a machine learning (ML) workflow and have decided to use Oracle Cloud Infrastructure (OCI) AutoML Pipeline. Which THREE are part of the AutoML Pipeline?

Options:

A.

Feature Selection

B.

Adaptive Sampling

C.

Model Deployment

D.

Feature Extraction

E.

Algorithm Selection

Question 44

Which of these is a unique feature of the published conda environment?

Options:

A.

Provides a comprehensive environment to solve business use cases

B.

Provides availability on network session reactivation

C.

Allows you to save the conda environment to an Object Storage Bucket

D.

Allows you to save the conda environment in a block volume

Question 45

You are working in your notebook session and find that your notebook session does not have enough compute CPU and memory for your workload. How would you scale up your notebook session without losing your work?

Options:

A.

Deactivate your notebook session, provision a new notebook session on a larger compute shape, and recreate all your file changes

B.

Download your files and data to your local machine, delete your notebook session, provision a new notebook session on a larger compute shape, and upload your files from your local machine to the new notebook session

C.

Ensure your files and environments are written to the block volume storage under the /home/datascience directory, deactivate the notebook session, and activate the notebook with a larger compute shape selected

D.

Create a temporary bucket in Object Storage, write all your files and data to Object Storage, delete the notebook session, provision a new notebook session on a larger compute shape, and copy your files and data from your temporary bucket to your new notebook session

Question 46

What is a common maxim about data scientists?

Options:

A.

They spend 80% of their time finding and preparing data and 20% analyzing it.

B.

They spend 80% of their time analyzing data and 20% finding and preparing it.

C.

They spend 80% of their time on failed analytics projects and 20% doing useful work.

Question 47

What is the primary difference between a data scientist and a data engineer?

Options:

A.

A data engineer collects and prepares data, and a data scientist then analyzes it.

B.

A data engineer analyzes data after a data scientist collects and prepares it.

C.

A data engineer builds data pipelines and helps prepare data, while a data scientist is responsible for data collection, preparation, and analysis.

D.

A data engineer creates data flows to be used as templates by the data analyst.

Demo: 47 questions
Total 158 questions