Which statement is true about origin management in Web Application Firewall (WAF)?
As a data scientist for a hardware company, you have been asked to predict the revenue demand for the upcoming quarter. You develop a time series forecasting model to analyze the data. Select the correct sequence of steps to predict the revenue demand values for the upcoming quarter.
You want to make your model more parsimonious to reduce the cost of collecting and processing data. You plan to do this by removing features that are highly correlated. You would like to create a heatmap that displays the correlation so that you can identify candidate features to remove. Which Accelerated Data Science (ADS) SDK method would be appropriate to display the correlation between Continuous and Categorical features?
What is a conda environment?
Which type of firewalls are designed to protect against web application attacks, such as SQL injection and cross-site scripting?
You want to write a Python script to create a collection of different projects for your data science team. Which Oracle Cloud Infrastructure (OCI) Data Science interface would you use?
Which of the following programming languages are most widely used by data scientists?
Which of these protects customer data at rest and in transit in a way that allows customers to meet their security and compliance requirements for cryptographic algorithms and key management?
You have an embarrassingly parallel or distributed batch job with a large amount of data running using Data Science Jobs. What would be the best approach to run the workload?
You are building a model and need input that represents data as morning, afternoon, or evening. However, the data contains a timestamp. What part of the Data Science lifecycle would you be in when creating the new variable?
You are a data scientist leveraging the Oracle Cloud Infrastructure (OCI) Language AI service for various types of text analyses. Which TWO capabilities can you utilize with this tool?
You are a data scientist working inside a notebook session and you attempt to pip install a package from a public repository that is not included in your conda environment. After running this command, you get a network timeout error. What might be missing from your networking configuration?
You are working as a data scientist for a healthcare company. They decided to analyze the data to find patterns in a large volume of electronic medical records. You are asked to build a PySpark solution to analyze these records in a JupyterLab notebook. What is the order of recommended steps to develop a PySpark application in OCI Data Science?
Select two reasons why it is important to rotate encryption keys when using Oracle Cloud Infrastructure (OCI) Vault to store credentials or other secrets.
What is the first step in the data science process?
Which OCI service provides a scalable environment for developers and data scientists to run Apache Spark applications at scale?
You have just started as a data scientist at a healthcare company. You have been asked to analyze and improve a deep neural network model, which was built based on the electrocardiogram records of patients. There are no details about the model framework that was built. What would be the best way to find more details about the machine learning models inside the model catalog?
Which statement about resource principals is true?
Which model has an open-source, open model format that allows you to run machine learning models on different platforms?
Six months ago, you created and deployed a model that predicts customer churn for a call centre. Initially, it was yielding quality predictions. However, over the last two months, users are questioning the credibility of the predictions. Which TWO methods would you employ to verify the accuracy of the model?
As a data scientist, you are tasked with creating a model training job that is expected to take different hyperparameter values on every run. What is the most efficient way to set those parameters with Oracle Data Science Jobs?
Which step is unique to MLOps, as opposed to DevOps?
You are a data scientist using Oracle AutoML to produce a model and you are evaluating the score metric for the model. Which of the following TWO prevailing metrics would you use for evaluating a multiclass classification model?
You are using a custom application with third-party APIs to manage application and data hosted in an Oracle Cloud Infrastructure (OCI) tenancy. Although your third-party APIs don’t support OCI’s signature-based authentication, you want them to communicate with OCI resources. Which authentication option must you use to ensure this?
Which is NOT a part of Observability and Management Services?
You are a data scientist with a set of text and image files that need annotation, and you want to use Oracle Cloud Infrastructure (OCI) Data Labeling. Which of the following THREE annotation classes are supported by the tool?
You are attempting to save a model from a notebook session to the model catalog by using the Accelerated Data Science (ADS) SDK, with resource principal as the authentication signer, and you get a 404 authentication error. Which two should you look for to ensure permissions are set up correctly?
Which activity is NOT a part of the machine learning life cycle?
You have received machine learning model training code, without clear information about the optimal shape to run the training on. How would you proceed to identify the optimal compute shape for your model training that provides a balanced cost and processing time?
You are creating an Oracle Cloud Infrastructure (OCI) Data Science job that will run on a recurring basis in a production environment. This job will pick up sensitive data from an Object Storage Bucket, train a model, and save it to the model catalog. How would you design the authentication mechanism for the job?
As a data scientist, you require a pipeline to train ML models. When can a pipeline run be initiated?
Which statement about Oracle Cloud Infrastructure Data Science Jobs is true?
What happens when a notebook session is deactivated?
Where do calls to stdout and stderr from score.py go in the model deployment?
You have created a conda environment in your notebook session. This is the first time you are working with published conda environments. You have also created an Object Storage bucket with permission to manage the bucket. Which TWO commands are required to publish the conda environment?
Which statement about logs for Oracle Cloud Infrastructure Jobs is true?
Which of the following best describes the principal goal of data science?
You want to create a user group for a team of external data science consultants. The consultants should only have the ability to see Data Science resource details but not have the ability to create, delete, or update Data Science resources. What verb should you write in the policy?
Which of these options allow the sharing and loading back of ML models into a notebook session?
You have a dataset with fewer than 1000 observations, and you are using Oracle AutoML to build a classifier. While visualizing the results of each stage of the Oracle AutoML pipeline, you notice that no visualization has been generated for one of the stages. Which stage is not visualized?
What is feature engineering in machine learning used for?
Which statement accurately describes an aspect of machine learning models?
As a data scientist, you are trying to automate a machine learning (ML) workflow and have decided to use Oracle Cloud Infrastructure (OCI) AutoML Pipeline. Which THREE are part of the AutoML Pipeline?
Which of these is a unique feature of the published conda environment?
You are working in your notebook session and find that your notebook session does not have enough compute CPU and memory for your workload. How would you scale up your notebook session without losing your work?
What is a common maxim about data scientists?
What is the primary difference between a data scientist and a data engineer?