Solution Manual to Big Data Analytics 17CS82 VTU CBCS Question Bank
In this article, you can find the solution manual of Big Data Analytics – 17CS82 of Computer Science and Engineering
8th Semester Big Data Analytics notes Computer Science and Engineering
Solution Manual Big Data Analytics 17CS82 VTU CBCS Module 1 Question Bank
2. Write and explain the mapper and reducer scripts for the MapReduce model. (08M Nov 2020)
4. Explain the following roles in HDFS deployment with a diagram i) High Availability ii) Name Node Federation. (08 M Nov 2020)
7. What do you understand by HDFS? Explain the components with a neat diagram. (10 M July 2019)
8. Bring out the concept of HDFS block replication with an example. (6 M July 2019)
9. With an example explain the different general HDFS commands.
11. What is HDFS? List the components of HDFS and explain any four of them. (9 M Jan 2020)
Big Data Analytics Module 2 Question Bank with Answers
2. Explain the features and benefits of apache HIVE in Hadoop. (08 M Nov 2020)
5. Explain the Apache Sqoop export and import method with a neat diagram. (10 M July 2019)
6. Explain with a neat diagram, the Apache Oozie workflow for Hadoop architecture. (6 M July 2019)
7. How do you run map Reduce and Message Passing Interface (MPI) on YARN architecture? (10 M July 2019)
8. What do you understand by YARN distributed shell? (6 M July 2019)
9. Describe with a neat diagram, the two-step Sqoop data export and import method. (8 M Jan 2020)
10. With neat diagram discusses the various frameworks that run under YARN. (8 M Jan 2020)
11. Discuss the different views supported by Apache Ambari. (6 M Jan 2020)
12. Discuss the various features of Hadoop YARN administration. (4 M Jan 2020)
13. Explain the different HDFS administration features. (6 M Jan 2020)
Big Data Analytics Module 3 Question Bank with Answers
1. Drawflow diagram of BIDM. Explain strategic and operational decisions. (8 M Nov 2020)
2. Write any four is Business Intelligence (BI) applications for various sectors. (8 M July 2019)
3. Explain the star schema of design of Data Warehousing with an example. (6 M July 2019)
6. What is a confusion matrix? Explain. (2 M July 2019)
7. Explain with diagram CRISP-DM data mining cycle. (8 M July 2019, Jan 2020, Nov 2020)
9. Differentiate between Data Mining and Data Warehousing. (3 M July 2019)
11. With a neat diagram explain data warehouse architecture. (6 M Jan 2020)
12. Describe the common data mining mistakes (4 M Jan 2020)
13. Describe the common data mining myths.
14. List and describe the various charts use for data visualization. (4 M Jan 2020)
15. What is data mining? Explain steps in data cleaning and preparation.
Big Data Analytics Module 4 Question Bank with Answers
1. What is a splitting variable? Describe three criteria for choosing a splitting variable. (4 M July 2019)
2. List the advantages and disadvantages of a regression model. (4 M July 2019)
Outlook | Temp | Humidity | Windy | Play |
Sunny | Hot | High | False | No |
Sunny | Hot | High | True | No |
Overcast | Hot | High | False | Yes |
Rainy | Mild | High | False | Yes |
Rainy | Cool | Normal | False | Yes |
Rainy | Cool | Normal | True | No |
Overcast | Cool | Normal | True | Yes |
Sunny | Mild | High | False | No |
Sunny | Cool | Normal | False | Yes |
Rainy | Mild | Normal | False | Yes |
Sunny | Mild | Normal | True | Yes |
Overcast | Mild | High | True | Yes |
Overcast | Hot | Normal | False | Yes |
Rainy | Mild | High | True | No |
Outlook | Temp | Humidity | Windy | Play |
Sunny | Hot | Normal | True | ? |
Age | Job | House | Credit | Loan Approved |
Young | False | No | Fair | No |
Young | False | No | Good | No |
Young | True | No | Good | Yes |
Young | True | Yes | Fair | Yes |
Young | False | No | Fair | No |
Middle | False | No | Fair | No |
Middle | False | No | Good | No |
Middle | True | Yes | Good | Yes |
Middle | False | Yes | Excellent | Yes |
Middle | False | Yes | Excellent | Yes |
Old | False | Yes | Excellent | Yes |
Old | False | Yes | Good | Yes |
Old | True | No | Good | Yes |
Old | True | No | Excellent | Yes |
Old | False | No | Fair | No |
Age | Job | House | Credit | Loan Approved |
Young | False | No | Good | ? |
5. Explain the design principles of an artificial neural network. (8 M July 2019)
TID | List of Items IDs |
T100 | I1, I2, I5 |
T200 | I2, I4 |
T300 | I2, I3 |
T400 | I1, I2, I4 |
T500 | I1, I3 |
T600 | I2, I3 |
T700 | I1, I3 |
T800 | I1, I2, I3, I5 |
T900 | I1, I2, I3 |
8. Describe the advantages and disadvantages of a regression model. (8 M Jan 2020)
9. Write the different steps involved in developing an artificial neural network. (5 M Jan 2020)
10. Describe the advantages of using ANN. (3 M Jan 2020)
1 | Milk | Egg | Bread | Butter |
2 | Milk | Butter | Egg | Ketchup |
3 | Bread | Butter | Ketchup | |
4 | Milk | Bread | Butter | |
5 | Bread | Butter | Cookies | |
6 | Milk | Bread | Butter | Cookies |
7 | Milk | Cookies | ||
8 | Milk | Bread | Butter | |
9 | Bread | Butter | Egg | Cookies |
10 | Milk | Butter | Bread | |
11 | Milk | Bread | Butter | |
12 | Milk | Bread | Cookies | Ketchup |
City Size | Avg. Income | Local Investors | LOHAS Awareness | Decision |
Big | High | Yes | High | Yes |
Medium | Med | No | Med | No |
Small | Low | Yes | Low | No |
Big | High | No | High | Yes |
Small | Med | Yes | High | No |
Med | High | Yes | Med | Yes |
Med | Med | Yes | Med | No |
Big | Med | No | Med | No |
Med | High | Yes | Low | No |
Small | High | No | High | Yes |
Small | Med | No | High | No |
Med | Heigh | No | Med | No |
City Size | Avg. Income | Local Investors | LOHAS Awareness | Decision |
Med | Med | No | Med | ? |
Big Data Analytics Module 5 Question Bank with Answers
1. Compare text mining with data mining. (8 M Nov 2020)
2. What is Naïve Bayes technique? Explain its model. (5 M July 2019)
3. Explain steps in the text mining process and architecture (8 M Nov 2020)
4. What is a support vector machine? Explain its model. (8 M July 2019)
5. Mention the 3-step process of Text Mining. (3 M July 2019)
6. Explain briefly the three different types of web mining. (6 M July 2019)
7. Compute the rank values for the nodes of the following network shown in below fig. Which is the highest-ranked node? Solve the same with eight iterations. (8 M July 2019, Nov 2020)
8. Describe the difference between text mining and data mining. (6 M)
9. Explain Naïve Bayes model. What are the advantages and disadvantages of the Naïve Bayes model?
10. Briefly describe the Support vector machine (SVM) technique. (4 M)
11. What are the advantages and disadvantages of Support vector machine – SVM?
Document ID | Keywords in the document | Class h |
1 | Love Happy Joy Joy Happy | Yes |
2 | Happy Love Kick Joy Happy | Yes |
3 | Love Move Joy Good | Yes |
4 | Love Happy Joy Love Pain | Yes |
5 | Joy Love Pain Kick Pain | No |
6 | Pain Pain Love kick | No |
7 | Love Pain Joy Love Kick | ? |
13. What is web mining? Explain the different types of web mining. (8 M)
15. Write a short note on Social Network Analysis (SNA). Numerical examples on Naïve Bayes Model, SVM, and SNA (Rank Calculation).
16. Suppose we have the height, weight, and T-shirt size of some customers and we need to predict the T-shirt size of a new customer given only the height and weight information we have. Data including height, weight, and T-shirt size information is shown below
Height (in cms) | 158 | 158 | 160 | 163 | 163 | 160 | 163 | 165 | 165 | 165 | 170 | 170 | 170 |
Weight (in kgs) | 58 | 59 | 60 | 60 | 61 | 64 | 64 | 61 | 62 | 65 | 63 | 64 | 68 |
T-Shirt Size | M | M | M | M | L | L | L | L | L | L | L | L | L |
Determine the T-Shirt size of a new customer with a weight of 61 kg and height of 161 cms using KNN with K=5.
8th Semester Big Data Analytics notes Computer Science and Engineering
8th Semester Computer Science and Engineering Sun Star Exam Scanner
2018 Scheme Computer Science and Engineering VTU CBCS Notes
If you like Solution Manual Big Data Analytics 17CS82 VTU CBCS, Subscribe to our YouTube channel for more videos and like the Facebook page for regular updates.