Data Analytics mcq with answers

Data Analytics mcq with answers | big data analytics mcq
1. A ____ is a decision support tool that uses a tree-like graph or model of decisions and their possible consequences, including chance event outcomes, resource costs, and utility.
- Decision tree
- Graphs
- Trees
- Neural Networks
Decision tree
2. What is Decision Tree?
- Flow-Chart
- Structure in which internal node represents test on an attribute, each branch represents outcome of test and each leaf node represents class label
- Flow-Chart & Structure in which internal node represents test on an attribute, each branch represents outcome of test and each leaf node represents class label
- None of Above
Flow-Chart & Structure in which internal node represents test on an attribute, each branch represents outcome of test and each leaf node represents class label
3. Decision Trees can be used for Classification Tasks.
- TRUE
- FALSE
TRUE
4. Choose from the following that are Decision Tree nodes?
- Decision Nodes
- End Nodes
- Chance Nodes
- All of Above
All of Above
5. Decision Nodes are represented by __
- Disks
- Squares
- Circles
- Triangles
Squares
6. Chance Nodes are represented by __
- Disks
- Squares
- Circles
- Triangles
Circles
7. End Nodes are represented by __
- Disks
- Squares
- Circles
- Triangles
Triangles
8. Which of the following are the advantage/s of Decision Trees?
- Possible Scenarios can be added
- Use a white box model, If given result is provided by a model
- Worst, best and expected values can be determined for different scenarios
- All of Above
All of Above
9. Which of the following statements about Naive Bayes is incorrect?
- Attributes are equally important.
- Attributes are statistically dependent of one another given the class value.
- Attributes are statistically independent of one another given the class value.
- Attributes can be nominal or numeric
Attributes are statistically dependent of one another given the class value.
10. Which of the following is not supervised learning?
- Clustering
- Decision Tree
- Linear Regression
- Naive Bayesian
Clustering
Data analytics mcq with answers
11. How many terms are required for building a bayes model?
- 1
- 2
- 3
- 4
3
12. Where does the bayes rule can be used?
- Solving queries
- Increasing complexity
- Decreasing complexity
- Answering probabilistic query
Answering probabilistic query
13. How the bayesian network can be used to answer any query?
- Full distribution
- Joint distribution
- Partial distribution
- All of Above
Joint distribution
14. What is the consequence between a node and its predecessors while creating bayesian network?
- Functionally dependent
- Dependant
- Conditionally independent
- Both Conditionally dependant & Dependant
Conditionally independent
15. Bayesian classifiers is
- A class of learning algorithm that tries to find an optimum classification of a set of examples using the probabilistic theory.
- Any mechanism employed by a learning system to constrain the search space of a hypothesis
- An approach to the design of learning algorithms that is inspired by the fact that when people encounter new situations, they often explain them by reference to familiar experiences, adapting the explanations to fit the new situation.
- None of these
A class of learning algorithm that tries to find an optimum classification of a set of examples using the probabilistic theory.
16. Bias is
- A class of learning algorithm that tries to find an optimum classification of a set of examples using the probabilistic theory
- Any mechanism employed by a learning system to constrain the search space of a hypothesis
- An approach to the design of learning algorithms that is inspired by the fact that when people encounter new situations, they often explain them by reference to familiar experiences, adapting the explanations to fit the new situation.
- None of these
Any mechanism employed by a learning system to constrain the search space of a hypothesis
17. Background knowledge referred to
- Additional acquaintance used by a learning algorithm to facilitate the learning process
- A neural network that makes use of a hidden layer
- It is a form of automatic learning.
- None of these
Additional acquaintance used by a learning algorithm to facilitate the learning process
18. Discriminating between spam and ham e-mails is a classification task
- TRUE
- FALSE
TRUE
19. which of the following is not involve in data mining?
- Knowledge extraction
- Data archaeology
- Data exploration
- Data transformation
Data transformation
20. Naive prediction is
- A class of learning algorithms that try to derive a Prolog program from examples
- A table with n independent attributes can be seen as an n- dimensional space.
- A prediction made using an extremely simple method, such as always predicting the same output.
- None of these
A prediction made using an extremely simple method, such as always predicting the same output.
data analytics mcq questions and answers
21. Node is ____
- A component of a network
- In the context of KDD and data mining, this refers to random errors in a database table.
- One of the defining aspects of a data warehouse
- None of these
A component of a network
22. Prediction is
- The result of the application of a theory or a rule in a specific case
- One of several possible enters within a database table that is chosen by the designer as the primary means of accessing the data in the table.
- Discipline in statistics that studies ways to find the most interesting projections of multi-dimensional spaces.
- None of these
The result of the application of a theory or a rule in a specific case
23. What is the relation between the distance between clusters and the corresponding class discriminability?
- proportional
- inversely-proportional
- no-relation
- None of these
proportional
24. the classification method in which the upper limit of interval is same as of lower class interval is called
- exclusive method
- inclusive method
- mid point method
- None of these
exclusive method
25. larger value is 60 and the smallest value is 40 and the number of classes is 5 then the class interval is
- 20
- 25
- 4
- 15
4
Big data analytics mcq
26. summary and presentation of data in tabular form with several non overlapping classes is referred as
- nominal distribution
- frequency distribution
- ordinal distribution
- None of these
frequency distribution
27. the classification method in which the upper and lower limit of interval is also in class interval itself is called
- exclusive method
- inclusive method
- mid point method
- None of these
inclusive method
28. Suppose there are 25 base classifiers. Each classifier has error rates of e = 0.35. Suppose you are using averaging as ensemble of above 25 classifiers will make a wrong prediction? Note: all classifiers are independent of each other
- 0.05
- 0.06
- 0.07
- 0.08
0.06
29. The most widely used metrics and tools to assess a classification model are:
- Confusion matrix
- Cost-sensitive accuracy
- Area under the ROC curve
- All of Above
All of Above
30. When performing regression or classification, which of the following is the correct way to preprocess the data?
- Normalize the data → PCA → training
- PCA → normalize PCA output → training
- Normalize the data → PCA → normalize PCA output → training
- None of these
Normalize the data → PCA → training
Data Analytics mcq sppu
31. Which of the following is true about Naive Bayes ?
- Assumes that all the features in a dataset are equally important
- Assumes that all the features in a dataset are independent
- both a and b
- None of these
both a and b
32. In which of the following cases will K-means clustering fail to give good results? 1) Data points with outliers 2) Data points with different densities 3) Data points with nonconvex shapes
- 1 and 2
- 2 and 3
- 1, 2, and 3
- 1 and 3
1, 2, and 3
Data analytics mcq sppu
data analytics mcq, data analytics mcq pdf, data analytics mcq questions and answers, data analytics mcq with answers, data analytics mcq with answers pdf, big data analytics mcq, data analytics multiple choice questions, data analytics sppu mcq, big data analytics mcq with answers, big data analytics mcq questions with answers