Questions for the D-DS-FN-23 were updated on : Nov 21 ,2025
What is the primary role of a business intelligence analyst on an analytics project?
C
You build a decision tree to classify five different types of customers based on their browsing history
from a sample of 500. The resulting decision tree has 17 layers. One of the leaf nodes has only three
customers.
What do you conclude?
D
What are three built-in data types in the R programming language?
A
Refer to Exhibit.
Refer to the exhibit, which shows pairwise counts for items purchased together.
Consider the following association rule: Milk -> Eggs
What is value of the lift?
A
Which component of a final presentation provides a succinct overview of the business situation that
was the impetus to initiate the project?
C
When building a K-means clustering model, you notice that the clusters did not segment on variables
that you expected. What should you do?
D
What is a benefit of Spark in-memory data processing as opposed to using MapReduce?
A
In which programming language is Hadoop written?
C
When should you consider using multinomial logistic regression over binary logistic regression?
C
Which SQL set operator returns rows that exist in the first SELECT statement answer set but not in the
second SELECT statement?
A
What are good reasons to develop a naïve Bayes classifier model?
D
Refer to Exhibit.
Refer to the exhibit.
To predict whether or not a customer will renew their annual property insurance policy, an insurance
company built and operationalized a naïve Bayes classification model. In the model, there are two
class labels, renewal and non-renewal, that are assigned to each customer based on their attributes.
A subset of the key attributes, their values, and corresponding conditional probabilities are provided
in the exhibit.
A customer has the following attributes:
● Age is greater than 65 years
● Owns their own home
● Renewal month is August
If 20% of customers do not renew the police every year, what is the score for a renewal in the naïve
Bayesian model for the customer described above?
D
In addition to quantitative and technical skills, what is a key aspect of the profile of a data scientist?
C
MapReduce is designed to process data in which way?
A
In the data preparation phase of the data analytics lifecycle, what does the term “data conditioning”
refer to?
D