diff --git a/ProgrammingAssignment_1/ProgrammingAssignment1.ipynb b/ProgrammingAssignment_1/ProgrammingAssignment1.ipynb index a3b30a259b7665993f9c2201e971b68716041229..a29d25dec7095eefb53e2bec8b2c1241cd64745f 100644 --- a/ProgrammingAssignment_1/ProgrammingAssignment1.ipynb +++ b/ProgrammingAssignment_1/ProgrammingAssignment1.ipynb @@ -6,9 +6,8 @@ "source": [ "# *k*-Nearest Neighbor\n", "\n", - "We'll implement *k*-Nearest Neighbor (*k*-NN) algorithm for this assignment. You can use data available in machine learning repositories such as [UCI Machine Learning Repository](https://archive.ics.uci.edu/ml/index.php) or a dataset related to your research. Your dataset should \n", - "* have labels (suited for classification)\n", - "* ideally have between 1,000 - 5,000 examples\n", + "We'll implement *k*-Nearest Neighbor (*k*-NN) algorithm for this assignment. We will use the **madelon** dataset as in Programming Assignment 0. \n", + "\n", "A skeleton of a general supervised learning model is provided in \"model.ipynb\". The functions that will be implemented there will be indicated in this notebook. \n", "\n", @@ -37,23 +36,22 @@ "\n", "| | Tasks | 478 | 878 |\n", "|---|----------------------------|-----|-----|\n", - "| 1 | Implement `distance` | 10 | 10 |\n", - "| 2 | Implement `k-NN` methods | 25 | 20 |\n", + "| 1 | Implement `distance` | 15 | 15 |\n", + "| 2 | Implement `k-NN` methods | 35 | 30 |\n", "| 3 | Model evaluation | 25 | 20 |\n", - "| 4 | Learning curve | 20 | 20 |\n", - "| 6 | ROC curve analysis | 20 | 20 |\n", + "| 5 | ROC curve analysis | 25 | 25 |\n", "\n", "### Mandatory for 878, bonus for 478\n", "\n", "| | Tasks | 478 | 878 |\n", "|---|----------------|-----|-----|\n", - "| 5 | Optimizing *k* | 10 | 10 |\n", + "| 4 | Optimizing *k* | 10 | 10 |\n", "\n", "### Bonus for 478/878\n", "\n", "| | Tasks | 478 | 878 |\n", "|---|----------------|-----|-----|\n", - "| 7 | Assess suitability of *k*-NN | 10 | 10 |\n", + "| 6 | Assess suitability of *k*-NN | 10 | 10 |\n", "\n", "Points are broken down further below in Rubric sections. The **first** score is for 478, the **second** is for 878 students. There are a total of 100 points in this assignment and extra 20 bonus points for 478 students and 10 bonus points for 878 students." ]