Python for Data Science Week 4 NPTEL Assignment Answers 2025

Need help with this week’s assignment? Get detailed and trusted solutions for Python for Data Science Week 4 NPTEL Assignment Answers. Our expert-curated answers help you solve your assignments faster while deepening your conceptual clarity.

✅ Subject: Python for Data Science
📅 Week: 4
🎯 Session: NPTEL 2025 July-October
🔗 Course Link: Click Here
🔍 Reliability: Verified and expert-reviewed answers
📌 Trusted By: 5000+ Students

For complete and in-depth solutions to all weekly assignments, check out 👉 NPTEL Python for Data Science Week 4 NPTEL Assignment Answers

🚀 Stay ahead in your NPTEL journey with fresh, updated solutions every week!

NPTEL Python for Data Science Week 4 Assignment Answers 2025

1. Which of the following are regression problems? Assume that appropriate data is given.

  • Predicting the house price.
  • Predicting whether it will rain or not on a given day.
  • Predicting the maximum temperature on a given day.
  • Predicting the sales of the ice-creams.
Answer : See Answers

2. Which of the following are multiclass classification problems?

  • Classifying emails as spam or not spam.
  • Classifying a person’s blood type as A, B, AB, or O.
  • Predicting the price of a second-hand car.
  • Classifying a movie genre into Drama, Comedy, Action, or Thriller.
Answer :

3. If a linear regression model achieves zero training error, can we say that all the data points lie on a straight line in the feature space?

  • Yes
  • No
Answer :

Read the information given below and answer the questions from 4 to 6:

Data Description:

An automotive service chain is launching its new grand service station this weekend. They offer to service a wide variety of cars. The current capacity of the station is to check 315 cars thoroughly per day. As an inaugural offer, they claim to freely check all cars that arrive on their launch day, and report whether they need servicing or not! Unexpectedly, they get 450 cars. The servicemen will not work longer than the working hours, but the data analysts have to! Can you save the day for the new service station? How can a data scientist save the day for them? He has been given a data set, ‘ServiceTrain.csv’ that contains some attributes of the car that can be easily measured and a conclusion that if a service is needed or not. Now for the cars they cannot check in detail, they measure those attributes and store them in ‘ServiceTest.csv

Problem Statement:

Use machine learning techniques to identify whether the cars require service or not.

Read the given datasets ‘ServiceTrain.csv’ and ‘ServiceTest.csv’ as train data and test data respectively and import all the required packages for analysis.

4. Which of the following machine learning techniques would NOT be appropriate to solve the problem given in the problem statement?

  • kNN
  • Random Forest
  • Logistic Regression
  • Linear regression
Answer :

5. After applying logistic regression, what is/are the correct observations from the resultant confusion matrix?

  • True Positive = 29, True Negative = 94
  • True Positive = 94, True Negative = 29
  • False Positive = 5, True Negative = 94
  • None of the above
Answer :

6. The logistic regression model built between the input and output variables is checked for its prediction accuracy of the test data. What is the accuracy range (in %) of the predictions made over test data?

  • 60 – 79
  • 90 – 95
  • 30 – 59
  • 80 – 89
Answer : See Answers

7. How are categorical variables preprocessed before model building?

  • Standardization
  • Dummy variables
  • Correlation
  • None of the above
Answer :

8. A regression model with the function y=80+4.5x was built to understand the impact of temperature x
on ice cream sales y. The temperature this month is 10 degrees more than the previous month. What is the predicted difference in ice cream sales?

  • 56 units
  • 45 units
  • 80 units
  • None of the above
Answer :

9. X and Y are two variables that have a strong linear relationship. Which of the following statements are incorrect?

  • There cannot be a negative relationship between the two variables.
  • The relationship between the two variables is purely causal.
  • One variable may or may not cause a change in the other variable.
  • The variables can be positively or negatively correlated with each other.
Answer :

10. A multiple linear regression model is built on the Global Happiness Index dataset ‘GHI Report.csv’. What is the RMSE of the baseline model?

  • 2.00
  • 0.50
  • 1.06
  • 0.75
Answer : See Answers