New Year Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70percent

CompTIA DA0-002 CompTIA Data+ Exam (2025) Exam Practice Test

Demo: 36 questions
Total 121 questions

CompTIA Data+ Exam (2025) Questions and Answers

Question 1

A data analyst creates a report, and some of the fields are empty. Which of the following conditions should the analyst add to a query to provide a list of all the records with empty fields?

Options:

A.

WHERE [ColumnName] = NULL

B.

WHERE [ColumnName] IS NULL

C.

WHERE [ColumnName] IS NOT NULL

D.

WHERE [ColumnName] = 'NULL'

Question 2

Which of the following data repositories stores unaltered data?

Options:

A.

Data lake

B.

Data warehouse

C.

Data table

D.

Data factory

Question 3

The director of operations at a power company needs data to help identify where company resources should be allocated in order to monitor activity for outages and restoration of power in the entire state. Specifically, the director wants to see the following:

* County outages

* Status

* Overall trend of outages

INSTRUCTIONS:

Please, select each visualization to fit the appropriate space on the dashboard and choose an appropriate color scheme. Once you have selected all visualizations, please, select the appropriate titles and labels, if applicable. Titles and labels may be used more than once.

If at any time you would like to bring back the initial state of the simulation, please click the Reset All button.

Options:

Question 4

A data analyst is following up on a recent, company-wide data audit of customer invoice data. Which of the following is the best option for the analyst to use?

Options:

A.

PCI DSS

B.

GDPR

C.

ISO

D.

PII

Question 5

A sales manager wants a dashboard that shows sales aggregated by region and identifies high-volume sales by salesperson per region. Which of the following communication techniques best displays this information?

Options:

A.

Defined parameters

B.

Filter options

C.

Level of detail

D.

User persona

Question 6

While interacting with a data set, a data analyst learns that copies of the data are distributed across several data repositories around the globe. Which of the following concepts best describes this practice?

Options:

A.

Redundancy

B.

High availability

C.

Replication

D.

Duplication

Question 7

A data analyst needs to join together a table data source and web API data source using Python. Which of the following is the best way to accomplish this task?

Options:

A.

Convert the data from the API and database to a varchar format and convert them to pandas DataFrames that are then merged together.

B.

Convert the data from the API and database to a JSON format and convert them to pandas DataFrames that are then merged together.

C.

Convert the data from the API and database to a TXT format and convert them to pandas DataFrames that are then merged together.

D.

Convert the data from the API and database to a string format and convert them to pandas DataFrames that are then merged together.

Question 8

A data analyst needs to modify a dashboard that was created by another employee. Upon opening the dashboard, the analyst notices that the information is not loading properly. Which of the following should the analyst do to troubleshoot this error?

Options:

A.

Review the data layer and data source.

B.

Validate that the database is up-to-date.

C.

Check that the program is updated to the latest version.

D.

Ensure the correct filters are displaying on the dashboard.

Question 9

Software end users are happy with the quality of product support provided. However, they frequently raise concerns about the long wait time for resolutions. An IT manager wants to improve the current support process. Which of the following should the manager use for this review?

Options:

A.

Infographic

B.

KPI

C.

Survey

D.

UAT

Question 10

A company's analytics manager wants all reports to be delivered once every seven days. Which of the following is the best delivery method?

Options:

A.

Recurring

B.

Ad hoc

C.

Custom

D.

Snapshot

Question 11

The human resources department wants to understand the relationship between the ages and incomes of all employees. Which of the following graphics is the most appropriate to present the analysis?

Options:

A.

Scatter plot

B.

Area plot

C.

Bar chart

D.

Pie chart

Question 12

A data analyst is analyzing the following dataset:

Transaction Date

Quantity

Item

Item Price

12/12/12

11

USB Cords

9.99

11/11/11

3

Charging Block

8.89

10/10/10

5

Headphones

50.15

Which of the following methods should the analyst use to determine the total cost for each transaction?

Options:

A.

Parsing

B.

Scaling

C.

Compressing

D.

Deriving

Question 13

A user needs a report that shows the main causes of customer churn rate in a three-year period. Which of the following methods provides this information?

Options:

A.

Inferential

B.

Descriptive

C.

Prescriptive

D.

Predictive

Question 14

Which of the following best describes a characteristic of a Boolean?

Options:

A.

Must be a character

B.

Must be two values only

C.

Must be numeric

D.

Must be system-created

Question 15

A data analyst receives an email from the IT department about renewing the company password, and the analyst follows the password reset link as required. Later in the week, the analyst receives the following notification when running a recurring analysis that connects to the database:

Log-in failed for user ‘

Which of the following is most likely the reason for this issue?

Options:

A.

The company changed its database authentication method.

B.

The password expiration process locked the account.

C.

The analyst did not change the password used to launch the report.

D.

The company is experiencing issues with password replication.

Question 16

Which of the following is the best reason for a company to use a CSV file to share data instead of an Excel file?

Options:

A.

CSV files can store different types of encoding.

B.

CSV files are not vendor-specific.

C.

CSV files are smaller in size.

D.

CSV files are easier to change in text editors.

Question 17

A table contains several rows of cellular numbers with call timestamps, call durations, called numbers, and carriers of the called number. Which of the following allows a data analyst to sort the cellular numbers based on the carriers of the called numbers and include the total call durations?

Options:

A.

SELECT cellular_number, called_number_carrier, SUM(call_duration) FROM calls GROUP BY cellular_number ORDER BY called_number_carrier;

B.

SELECT cellular_number, SUM(call_duration) FROM calls GROUP BY call_duration ORDER BY called_number_carrier;

C.

SELECT cellular_number, called_number_carrier, SUM(call_duration) FROM calls GROUP BY cellular_number, called_number_carrier ORDER BY called_number_carrier;

D.

SELECT cellular_number, called_number_carrier, SUM(call_duration) FROM calls GROUP BY call_duration ORDER BY called_number_carrier;

Question 18

A data analyst wants to use the following tables to find all the customers who have not placed an order:

Customers table

ID

Name

Address

Products table

ID

Name

Customer_ID

Which of the following SQL statements is the best way to accomplish this task?

Options:

A.

SELECT * FROM CUSTOMERS AS C LEFT JOIN PRODUCTS AS P ON C.ID = P.Customer_ID WHERE P.Customer_ID IS NULL

B.

SELECT * FROM CUSTOMERS AS C INNER JOIN PRODUCTS AS P ON C.ID = C.ID WHERE COUNT(P.) = 0

C.

SELECT * FROM PRODUCTS AS P INNER JOIN CUSTOMERS AS C ON P.Customer_ID = C.ID WHERE (SELECT COUNT(P.) = 0)

D.

SELECT * FROM PRODUCTS AS P LEFT JOIN CUSTOMERS AS C ON P.Customer_ID = C.ID WHERE P.Customer_ID IS NOT NULL

Question 19

Given the following tables:

Individual table

ID

FirstName

LastName

1

John

Doe

Output

ID

FullName

1

JohnDoe

Which of the following is the best option to display output from FirstName and LastName as FullName?

Options:

A.

Concatenate

B.

Filter

C.

Join

D.

Group

Question 20

A data analyst is creating a report about cars sold within the last twelve months. The current data set only contains information from the last ten months. Which of the following is the most appropriate way to accomplish this task?

Options:

A.

Join the records from the last two months of the previous year and append them to the data table.

B.

Parse the records from the last two months of the previous year and append them to the data table.

C.

Merge the records from the last two months of the previous year and append them to the data table.

D.

Extract the records from the last two months of the previous year and append them to the data table.

Question 21

Which of the following data repositories stores unstructured and structured data?

Options:

A.

Data store

B.

Data silo

C.

Data mart

D.

Data lake

Question 22

A data analyst learns that a report detailing employee sales is reflecting sales only for the current month. Which of the following is the most likely cause?

Options:

A.

Lack of permissions

B.

An error in SQL code

C.

Report refresh failure

D.

Connectivity issues

Question 23

A data analyst encounters an issue with new software and a code that they are using. The analyst includes print statements in the code to try to identify the issue, without success. An informal peer review of the code also produces the same result. The analyst confirms that the software is updated to the latest version and compatible with the code. Which of the following troubleshooting steps should the analyst take next?

Options:

A.

Use the old software and preexisting code, since both were functional.

B.

Contact the IT department and inform them that the software has a bug.

C.

Escalate to the department manager and ask for assistance.

D.

Research the issue online and see if a solution is available.

Question 24

The sales department wants to include the composition of total sales amounts across all three sales channels in a report. Given the following sample sales table:

Sales channel

Month

Sales (million $)

Digital

January

135

Store

February

145

Online

March

165

Store

April

200

Store

May

125

Online

June

155

Digital

July

120

Online

August

145

Digital

September

160

Which of the following visualizations is the most appropriate?

Options:

A.

Pivot table

B.

Pie chart

C.

KPI card

D.

Box plot

Question 25

Which of the following best enables the retrieval and manipulation of data that is stored in a relational database?

Options:

A.

XML

B.

SQL

C.

Excel

D.

JavaScript

Question 26

Which of the following tables holds relational keys and numeric values?

Options:

A.

Fact

B.

Graph

C.

Dimensional

D.

Transactional

Question 27

A marketing firm wants to find the average age of its consumers to better promote its products. Given the following dataset:

Name

Date of birth

Age

Jane

March 24

34

John

July 17

11

Joe

November 29

29

Ann

December 13

14

Robert

December 14

63

Which of the following is the mean of the consumer ages?

Options:

A.

29

B.

36

C.

40

D.

63

Question 28

A data professional wants to identify all customers who made a purchase in January. Given the following table:

CustomerID

Month

Sales

0001

January

13000

0002

March

10000

0003

April

23000

0004

May

10000

Which of the following types of functions should the professional use to flag the customers?

Options:

A.

Statistical

B.

Logical

C.

Mathematical

D.

Date

Question 29

A developer builds an online survey that requires all questions to have an answer. Which of the following inconsistencies does this setting prevent?

Options:

A.

Missing values

B.

Duplication

C.

Data corruption

D.

Completeness

Question 30

A company has a document that includes the names of key metrics and the standard for how those metrics are calculated company-wide. Which of the following describes this documentation?

Options:

A.

Data dictionary

B.

Data explainability report

C.

Data lineage

D.

Data flow diagram

Question 31

A data analyst needs to identify outliers from a given dataset. Which of the following visualizations is the best way to identify outliers?

Options:

A.

Box plot

B.

Scatter plot

C.

Gantt chart

D.

Waterfall chart

Question 32

A data analyst pulls a table similar to the following one:

ID

Type

TypeID

Phone

1

Full Time

Full Time 1

Mobile

2

Part Time

Part Time 2

Work

3

Full Time

Full Time 3

Mobile

Which of the following best explains the data issue with TypeID?

Options:

A.

Redundancy

B.

Outlier

C.

Missing data

D.

Duplication

Question 33

Which of the following best explains the purpose of data lineage?

Options:

A.

To see the steps and path of data flow through different systems

B.

To better understand the granularity of data variable relationships

C.

To track data transformations from acquisition through reporting

D.

To look up data definitions, ensuring consistent use across business units

Question 34

A company reports on seven years of data in a sales dashboard. The dashboard pulls from a sales database that has 30 years of data. The dashboard performance is slow. Which of the following is the best way to improve the dashboard's performance?

Options:

A.

Performing a code review

B.

Checking network connectivity

C.

Filtering to include only relevant data

D.

Adding more RAM and rerunning

Question 35

Due to new reporting requirements, a data analyst must add new classification codes to historical data. Which of the following is the best technique for this task?

Options:

A.

Append

B.

Binning

C.

Parsing

D.

Union

Question 36

A data analyst creates a report that identifies the middle 50% of the collected data. Which of the following best describes the analyst's findings?

Options:

A.

Interquartile range

B.

The difference between mode and median

C.

Mean variance

D.

Skewness from the slope

Demo: 36 questions
Total 121 questions