**Data Analytics mcq with answers**

**Data Analytics mcq with answers | big data analytics mcq**

**1. A ____ is a decision support tool that uses a tree-like graph or model of decisions and their possible consequences, including chance event outcomes, resource costs, and utility.**

- Decision tree
- Graphs
- Trees
- Neural Networks

Decision tree

**2. What is Decision Tree?**

- Flow-Chart
- Structure in which internal node represents test on an attribute, each branch represents outcome of test and each leaf node represents class label
- Flow-Chart & Structure in which internal node represents test on an attribute, each branch represents outcome of test and each leaf node represents class label
- None of Above

Flow-Chart & Structure in which internal node represents test on an attribute, each branch represents outcome of test and each leaf node represents class label

**3. Decision Trees can be used for Classification Tasks.**

- TRUE
- FALSE

TRUE

**4. Choose from the following that are Decision Tree nodes?**

- Decision Nodes
- End Nodes
- Chance Nodes
- All of Above

All of Above

**5. Decision Nodes are represented by __**

- Disks
- Squares
- Circles
- Triangles

Squares

**6. Chance Nodes are represented by __**

- Disks
- Squares
- Circles
- Triangles

Circles

**7. End Nodes are represented by __**

- Disks
- Squares
- Circles
- Triangles

Triangles

**8. Which of the following are the advantage/s of Decision Trees?**

- Possible Scenarios can be added
- Use a white box model, If given result is provided by a model
- Worst, best and expected values can be determined for different scenarios
- All of Above

All of Above

**9. Which of the following statements about Naive Bayes is incorrect?**

- Attributes are equally important.
- Attributes are statistically dependent of one another given the class value.
- Attributes are statistically independent of one another given the class value.
- Attributes can be nominal or numeric

Attributes are statistically dependent of one another given the class value.

**10. Which of the following is not supervised learning?**

- Clustering
- Decision Tree
- Linear Regression
- Naive Bayesian

Clustering

**Data analytics mcq with answers**

**11. How many terms are required for building a bayes model?**

- 1
- 2
- 3
- 4

3

**12. Where does the bayes rule can be used?**

- Solving queries
- Increasing complexity
- Decreasing complexity
- Answering probabilistic query

Answering probabilistic query

**13. How the bayesian network can be used to answer any query?**

- Full distribution
- Joint distribution
- Partial distribution
- All of Above

Joint distribution

**14. What is the consequence between a node and its predecessors while creating bayesian network?**

- Functionally dependent
- Dependant
- Conditionally independent
- Both Conditionally dependant & Dependant

Conditionally independent

**15. Bayesian classifiers is**

- A class of learning algorithm that tries to find an optimum classification of a set of examples using the probabilistic theory.
- Any mechanism employed by a learning system to constrain the search space of a hypothesis
- An approach to the design of learning algorithms that is inspired by the fact that when people encounter new situations, they often explain them by reference to familiar experiences, adapting the explanations to fit the new situation.
- None of these

A class of learning algorithm that tries to find an optimum classification of a set of examples using the probabilistic theory.

**16. Bias is**

- A class of learning algorithm that tries to find an optimum classification of a set of examples using the probabilistic theory
- Any mechanism employed by a learning system to constrain the search space of a hypothesis
- An approach to the design of learning algorithms that is inspired by the fact that when people encounter new situations, they often explain them by reference to familiar experiences, adapting the explanations to fit the new situation.
- None of these

Any mechanism employed by a learning system to constrain the search space of a hypothesis

**17. Background knowledge referred to**

- Additional acquaintance used by a learning algorithm to facilitate the learning process
- A neural network that makes use of a hidden layer
- It is a form of automatic learning.
- None of these

Additional acquaintance used by a learning algorithm to facilitate the learning process

**18. Discriminating between spam and ham e-mails is a classification task**

- TRUE
- FALSE

TRUE

**19. which of the following is not involve in data mining?**

- Knowledge extraction
- Data archaeology
- Data exploration
- Data transformation

Data transformation

**20. Naive prediction is**

- A class of learning algorithms that try to derive a Prolog program from examples
- A table with n independent attributes can be seen as an n- dimensional space.
- A prediction made using an extremely simple method, such as always predicting the same output.
- None of these

A prediction made using an extremely simple method, such as always predicting the same output.

**data analytics mcq questions and answers**

**21. Node is** ____

- A component of a network
- In the context of KDD and data mining, this refers to random errors in a database table.
- One of the defining aspects of a data warehouse
- None of these

A component of a network

**22. Prediction is**

- The result of the application of a theory or a rule in a specific case
- One of several possible enters within a database table that is chosen by the designer as the primary means of accessing the data in the table.
- Discipline in statistics that studies ways to find the most interesting projections of multi-dimensional spaces.
- None of these

The result of the application of a theory or a rule in a specific case

**23. What is the relation between the distance between clusters and the corresponding class discriminability?**

- proportional
- inversely-proportional
- no-relation
- None of these

proportional

**24. the classification method in which the upper limit of interval is same as of lower class interval is called**

- exclusive method
- inclusive method
- mid point method
- None of these

exclusive method

**25. larger value is 60 and the smallest value is 40 and the number of classes is 5 then the class interval is**

- 20
- 25
- 4
- 15

4

**Big data analytics mcq**

**26. summary and presentation of data in tabular form with several non overlapping classes is referred as**

- nominal distribution
- frequency distribution
- ordinal distribution
- None of these

frequency distribution

**27. the classification method in which the upper and lower limit of interval is also in class interval itself is called**

- exclusive method
- inclusive method
- mid point method
- None of these

inclusive method

**28. Suppose there are 25 base classifiers. Each classifier has error rates of e = 0.35. Suppose you are using averaging as ensemble of above 25 classifiers will make a wrong prediction? Note: all classifiers are independent of each other**

- 0.05
- 0.06
- 0.07
- 0.08

0.06

**29. The most widely used metrics and tools to assess a classification model are:**

- Confusion matrix
- Cost-sensitive accuracy
- Area under the ROC curve
- All of Above

All of Above

**30. When performing regression or classification, which of the following is the correct way to preprocess the data?**

- Normalize the data → PCA → training
- PCA → normalize PCA output → training
- Normalize the data → PCA → normalize PCA output → training
- None of these

Normalize the data → PCA → training

**Data Analytics mcq sppu**

**31. Which of the following is true about Naive Bayes ?**

- Assumes that all the features in a dataset are equally important
- Assumes that all the features in a dataset are independent
- both a and b
- None of these

both a and b

**32. In which of the following cases will K-means clustering fail to give good results? 1) Data points with outliers 2) Data points with different densities 3) Data points with nonconvex shapes**

- 1 and 2
- 2 and 3
- 1, 2, and 3
- 1 and 3

1, 2, and 3

**Data analytics mcq sppu**

data analytics mcq, data analytics mcq pdf, data analytics mcq questions and answers, data analytics mcq with answers, data analytics mcq with answers pdf, big data analytics mcq, data analytics multiple choice questions, data analytics sppu mcq, big data analytics mcq with answers, big data analytics mcq questions with answers