Assignment help

Join our 150К of happy users

Get original papers written according to your instructions and save time for what matters most.

Card image cap

COM7039M Machine Learning Assignment Help

 Machine Learning Assignment Brief

Contents

Module Details........................................................................................................................ 1

Assignment Description......................................................................................................... 2

Learning Outcomes................................................................................................................ 4

Advice and Guidance............................................................................................................. 4

How is this assessment marked?......................................................................................... 6

Marking Criteria....................................................................................................................... 4

 

Module Details
Module code:COM7039MLevel of Study:7
Module Leader(s):Dr. Rebecca Jeyavadhanam. BCredits:15
Assessmentformat:

Creative Artefact- A practical project to design and develop an ML prediction model with supporting

documentation

Method of submission:Turnitin within Moodle
Deadline or Assessment Period:28th Jan 2025, 12NoonFeedback date and place:19th Feb 2025, Written feedback within Turnitin/Moodle

Assessmentlimits:

length, load,word count,

etc.

N/AComponentnumber:1 of 1
Is this exempt from anonymous marking under the policy?NoComponent weighting:100%

 

 

Assignment Description

This coursework aims to demonstrate the students' comprehensive understanding and knowledge of the Machine Learning module by evaluating their analytical abilities and strengths. It comprises two tasks designed to assess and challenge their analytical skills. These tasks have been carefully crafted to test the student's proficiency in applying the concepts learnedthroughout the moduleand to showcase their ability to tackle real-world scenarios and problems. Successful completion of these tasks willreflect the students' mastery of thesubject matter andtheir capacity for critical and creative thinking.

 

The Assessment Consists of TWO Tasks:

In Task 1, you will be presented with a set of questions that require critical evaluation of your subject knowledge and understanding of the concepts related to theMachine Learning (ML)process and techniques.

 

In Task 2, you will be provided a programming exercise with a dataset to analyse usingthe Machine Learning approach. Your objective is to use suitable MachineLearning algorithms to predict and evaluate a model's accuracy. The dataset will contain various features and a target variable thatyou need to predict. You are expected to develop a model, followed by training, testing, and evaluating its performance. Your goal is to identify and highlight the best predictive model for classification. This includes comparing different models and selecting the one that offers the highest accuracy and best performance metrics for the classification task.

 

The contentof this assignment must be supported by the inclusion of pertinent academic theories, concepts, models, and contemporary industrial insights. Provide a detailed and relevant description of your code. Ensure your work is accurately cited and referenced using the York St John Harvard Referencing Style.

 

Task 01: Theory Exercise – (20 Marks)

 

  1. Discuss the K-means clustering algorithm in detail, including its working principles, advantages, disadvantages, and real-world applications. (10 Marks)

 

  1. How would you evaluate the performance of the classification algorithm with appropriate metrics? (10 Marks)

 

Task 02: Programming Exercise -– (80 Marks)

 

a. Develop a classification modelwith Machine Learning techniques to detecthate speech fromthe Twitter dataset.

 

Dataset Description:

 

Assignment Description

 

The "Hate Speech and Offensive Language" dataset is collected from Twitter. It is primarily designed to support research and development in detecting and analyzing hate speech and offensive language on social media, distinguishing them fromordinary slang andneutral content.

Dataset Link:

https://www.kaggle.com/datasets/mrmorj/hate-speech-and-offensive-language-dataset

 

Alternative Source:

The datasets are available to download fromthe Machine Learning (ML) module inthe Moodle platform.

Guidelines to Prepare Your Assignment:

 

1. Data Exploration and Pre-processing: (10 Marks)

  • Load and explore the datasetto gain insights into the data'scharacteristics.
  • Handle missing values, if any,and perform datacleansing as required.
  • Perform datavisualization to understand variables' distribution and relationships withthem.

2. Feature Engineering Process: (15 marks)

  • Extract meaningful information from the dataset usingFeature Extraction Technique
  • Reduce the number of features usingPCA as the Dimensionality Reduction Technique
  • Select themost prominent features to capture the complex relationships
  • Handle the outliers to prevent themfrom significantly affecting the model

3. ModelSelection and Training: (10 Marks)

  • Split thedataset into training and testing sets.
  • Select appropriate machine learning algorithms (e.g., logistic regression, decision trees, random forests, support vector machines, etc.) for the predictive modeling task.
  • Train the selected modelson the training data and evaluate their performance on the testing data.

4. Hyperparameter Tuning: (10 Marks)

  • Fine-tune the hyperparameters of the chosen algorithm to optimize the model'sperformance and avoidmodel overfitting.

5. ModelEvaluation: (20 Marks)

  • Compare the performance of different modelsusing appropriate evaluation metrics such as accuracy, precision, recall, F1-score, area under the receiver operating characteristic curve (AUC-ROC), Confusion matrix, and Logarithmic loss (Log Loss).
  • Comparative analysis to identify the best-performing modelfor classification.

6. Model Deployment: (10 Marks)

 

Assignment Description
  • Deploy the model to generate predictions for new and previously unseendata.

7. Conclusion and Recommendations: (5 marks)

  • Summarize the key findings of your analysis, highlighting the model'sperformance and any insights gained.

Propose potential improvements or additional steps that couldbe taken to enhance the system.

Learning Outcomes

PLOs 7.1-7.7

  1. Evaluate computer science concepts and principles and their application to the effective design, implementation, and usability of computer-based systems.
  2. Apply the findings of advanced scholarship and/or contemporary research andpractice tothe solution to computer science problems
  3. Critically evaluate computer science problems, including thoseat the forefront of the field.
  4. Demonstrate operation within applicable professional, legal, social,and ethical frameworks.
  5. Demonstrate originality and creativity in the solution of computer science problems.
  6. Recommend, with detailed justification, the appropriate computer science principles and practices to apply to significant domain-specific activity.
  7. Apply standards, qualityprocesses, and engineering principles to the solution of computer scienceproblems.
Advice and Guidance
Submission Guidelines:
Advice and Guidance
  • Prepare     a     comprehensive    report     documenting    your     approach, methodologies, results, and insights gainedfrom the project.
  • Include code snippets, visualizations, and explanations to support yourfindings. Your reportshould be clear,concise, and well-organized.
  • Submit the complete report, including both Task 1 and Task 2, as a singlePDF file. The code must be submitted as a supporting document inthe .ipynb fileformat.

 

AdditionalGuidelines for Students:

Students must submit their own work. They must acknowledge the sources used in this assignment, failure to acknowledge would be plagiarism which is an academic offence and a penaltycan be imposed. Students need to write by reading other papers on their own with citations and leave references at the end of the assignment.

Students work would be submitted to the national plagiarism facility. This identifies the sources from the internet and other extensive databases. Once the student’s work has been submitted to detection services, work is stored in databases electronically and compared their work from other sources. It is necessary to keep a backup of their work.Students’ materials wouldbe stored in the database electronically for indefinite periods.

It is essential that you acknowledge the source of any research, information, ideas, opinions, theories, or other material which is not your own. Effective referencing,  quoting,  paraphrasing,  and  summarising show  evidence  of  the reading you have doneand ensure thatyou avoid accusations of plagiarism.

The University's fundamental stance on the use of Turnitin is geared toward supporting students' academic development. You can use this linkto check your work for areas where you might be at risk of plagiarising.

Please submit your assignment on time. All assignments may be electronically submitted using Turnitin (via Moodle) by midnight on the due date. Please do not submityour assignment last minute. Pleasealso allow time for any problems or issues withsystems.

The work you present should be your own work, and not just copied from others.  You can quote from others, but you must say who the author is and use quotation marks or paraphrase.  If you do not do so, we will investigate your work for academic misconduct.  This is particularly likely if your Turnitin similarity score is above 25% and/or individual matches are above6%.


Want to boost your HND grades? Learn more about our HND Assignment Help and how our experts can guide you.

Advice and Guidance

If you requiresupport with your study skills,please visit https://www.yorksj.ac.uk/students/study-skills/

Please refer to the York St John University Code of Practice for Assessmentand Academic RelatedMatters 2024-25.

We ask that you pay particular attention to the academic misconduct policy. Penalties will be applied where a student is found guilty of academic and/or ethicalmisconduct, including termination of programme (Policy Link).

You are required to keep to the word limit set for an assessment and to notethat you may be subject to penalty if you exceedthat limit. You are required to provide an accurate word count on the cover sheet for each piece of work you submit(Policy Link).

For late or non-submission of work by the published deadline or an approved extended deadline, a mark of 0NS will be recorded. Where a re-assessment opportunity exists, a student will normally be permitted only one attempt to be re-assessed for a cappedmark (Policy Link).

An extension to the published deadline may be granted to an individual student if theymeet the eligibility criteria of the (Policy Link).

 

How is this assessment marked?

Your work will be marked according to the assessment instructions provided within thisdocument and the selected Learning Outcomes’ (LOs) (see above).

 

Furthermore, this assessment is marked usingthe assessment marking criteria or a similar rubric that aligns with the University’s Generic Assessment Descriptors (see below).1 This is to ensure all assessment decisions are comparable regardless of the discipline or mode of assessment.

 

Please note that you must meet the required baseline standards (50 – 59%) which will include the LOs and minimum expectations of the assessment. Further still, you must ensure youmeet the requirements of each gradeboundary to progress to the next,i.e., you should demonstrate your learning through the standards of the Pass, Merit and Distinction to reach a Distinction (70 – 84%). These standards are designed to scaffold and build your learning to achieve yourfullest potential in each criterion being assessed.

Deliverables for Task 1 and Task 2

 

 

 

 

 

Task 01- 20

Marks

 DeliverablesMarks
a

An extraordinary conceptual understanding of K-meansclustering algorithm, advantages, and disadvantages with real-world applicationsIf anyexamples are

provided.

10

Marks

b

An appropriate description of all the metrics like accuracy, precision, recall, F1-score, area under the receiver operating characteristic curve (AUC-ROC), Confusion matrix, and Logarithmic

loss (Log Loss.)

10

Marks

 Data Exploration and Pre-processing

Correct handling of missing values,

outliers,    and   data    normalization.

10

Marks

  Effective exploratory data analysis 
  (EDA) to    gain   insights into   the 
  dataset. Providea well-documented 
Task 02- 

Jupyter       Notebook       or       scriptcontaining    the     code     for     data

preprocessing steps.Ensure that

 

80

Marks

 each step is properly commented on to explainits purpose and functionality. 
 

Feature

Engineering

Present      code      segments      that

generate newfeatures based on

15

Marks

  domain     knowledge    or     creative 
  insights.    Discuss    the    encoding 
  methods chosenand their suitability 
  for        the        problem.        Applying 
  mathematical     transformations     to 
  numerical         features.         Scaling 
  numerical features to ensure they 
  are on similar scales.Discuss the 
  feature selection and its importance. 
 Model Selection and Training

Correct implementation of selected

machine      learning      algorithms

10

Marks

  should be presented. Splitting the 
  dataset intotraining and testing 
  (validation) setsusing techniques 
  like the train-test splitor k-fold 
  cross-validation. Adequate use of 
  libraries and tools to streamline the 
  implementationprocess. 

 

  Explanation of the methodyou used10
Hyperparameter Tuningfor hyperparameter tuning reasons for selecting this method and how itMarks
 suits your specific problem. Data 
 Splitting Strategy: How you divided 
 your data into training, validation, 
 and test sets.Model Training and 
 Evaluation Protocol: Description of 
 how   you   trained   and   evaluated 
 models for different hyperparameter 
 configurations. Explanation of the 
 performance metric(s) you used to 
 assess model performance. 
Model EvaluationAccurate     evaluation     of     model20
 performance usingrelevant metricsMarks
 (e.g., accuracy, precision, recall, F1- 
 score). 
 Comprehensive      comparison      of 
 multiple       models        and        their 
 strengths/limitations.           Insightful 
 interpretation of results and trends 
 observed. 
 Model DeploymentTesting the developed model using real-world data or unseendata that it performs as expected and provides accurate predictions and validates the model performance.

10

Marks

 Conclusion and RecommendationsThe clear and organized structure ofthe report with proper sections (Introduction, Methodology, Results, Discussion, Conclusion). Coherent explanations of the implemented algorithms and techniques. Effective visualization of results throughgraphs, charts, and tables. Cohesive and well-written analysis of findings and conclusions.

5

Marks

  Total Marks

100

Marks

 

 

Marking Criteria

Pass Grade Bands (100 – 50) (Learning Outcomes must be met)

Fail Grade Bands(49 – 0) (Learning Outcomes are not met)

 

Assessment Criteria

Pass

(50 – 59)

Merit

(60 – 69)

Distinction

(70 – 84)

Distinction

(85 – 100)

Borderline Fail

(45 - 49)

(Credits may be compensated)

Fail

(30 - 44)

(Creditsmay not becompensated)

Fail

(0 - 29)

(Credits may not be compensate

d)

Task 1a- 10%

An

extraordin ary

Demonstrates a deep and

insightful understanding of the NB algorithm with

Shows a strong

understanding with relevant examples

Provides a good

understanding with some examples and

Demonstrates

adequate understanding with

Shows limited

understanding with insufficient examples

Demonstrates a

poor understanding

Fails to

demonstrat e

 conceptuadetailed examples and criticaland solid analysis.analysis.basic examples andor analysis.with little to nounderstandi
 lanalysis.  some discussion. relevantng of
 understan ding of     examples.algorithm.
 the Naïve       
 Bayes       
 algorithm,       
 including       
 its       
 advantage       
 s,       
 disadvant       
 ages, and       
 real-world       
 applicatio       
 ns.       
Task 1b- 10%

An innovative

approach to

Exhibits exceptional creativity and originality in problem- solving.

Shows strong creativity with

effective problem- solving approaches.

Demonstrates good creativity with some innovative solutions.

Provides adequate creativity withbasic

problem-solving approaches.

Limited creativity with few innovative solutions.

Shows minimal creativity with

ineffective problem-solving.

Fails to demonstrat

e creativity or effective

 problem-      problem-
 solving      solving.
 with       
 creative       
 insights       
 and       
 solutions.       
Data Exploration andPre-processing-10%Excellent handling of missing values, outliers, and data normalization. Effective and insightful EDA with a well- documented Jupyter Notebook or script.Strong handling of data issues and effective EDA. Well- documented with minor gaps in explanation.Good handling of data issues and EDA. Adequate documentation and explanation.Adequate handling of data issues with basic EDA. Documentation and explanations are present but may lack depth.Limited handling of data issues or EDA. Incomplete documentation or explanation.Poor handling of data issues with insufficient EDA. Inadequate documentation and explanations.

Fails to handle data issueseffectively. Lacks proper EDA and documentati

on

Feature Engineering Process-15%

Innovative and effective feature engineering with detailed explanation of encoding methods, mathematical

transformations, scaling, and feature selection.

Strong feature engineering with good explanation of methods and transformations.Good feature engineering with some explanation of methods and transformations.Adequate feature engineering with basic explanation of methods and transformations.Limited feature engineering with minimalexplanation.Poor feature engineering with insufficient explanation.Fails to demonstrat e effective feature engineering.
Model Selection and Training-10%Excellent implementation of algorithms with thoughtful data splitting and optimal useof tools and libraries.

Strong implementation with appropriate data splitting and effective use of

tools.

Good implementation with correct data splitting and adequate tool usage.Adequate implementation with basic data splitting and tool usage.Limited implementation with inappropriate data splitting or tool usage.

Poor implementation with ineffective data splitting or minimal tool

usage.

Fails to implement models correctly or use tools

effectively.

Hyperparameter Tuning-10%Comprehensive evaluation using relevant metrics with insightful comparison and interpretation of results.Accurate evaluation with good comparison and interpretation of multiple models.Good evaluation with appropriate metrics and some comparison of models.Adequate evaluation with basic metrics and limited comparison of models.Limited evaluation with minimal use of metrics and comparison.Poor evaluation with inadequate metrics and no comparison of models.

Fails to evaluate models effectively or provide meaningful

insights.

Model Evaluation-20%Comprehensive evaluation using relevant metrics with insightful comparison and interpretation of results.Accurate evaluation with good comparison and interpretation of multiple models.Good evaluation with appropriate metrics and some comparison of models.Adequate evaluation with basic metrics and limited comparison of models.Limited evaluation with minimal use of metrics and comparison.Poor evaluation with inadequate metrics and no comparison of models.

Fails to evaluate models effectively or provide

meaningful insights.

 

Model Deployment-10%

Thorough testing of the model with real-world or unseen data,demonstrating accurate predictions and validation of performance.Effective testing with real-world or unseen data and validation of performance.Good testing with some validation of performance using unseendata.Adequate testing with basic validation of performance.Limited testing with insufficient  validation of performance.Poor testing with minimal or ineffective validation of performance.Fails to test or validate model performance effectively.
Report Structure and Clarity-5% [Communication]Clear, organized report with detailed sections, insightful explanations, and effective visualizations. Cohesive analysis and conclusions.Well-structured report with good explanations and visualizations.Clear report with adequate structure, explanations, and some visualizations.Adequate report with basic structure and analysis, though visualizations may be lacking.Limited report with unclear structure and insufficient analysis or visualizations.Poor report with minimal organization, analysis, and visualizations.

Fails to provide a coherent report or meaningful

analysis and

 


Searching for assignment help in Australia? Head over to Punjab Assignment Help for expert guidance and academic support across diverse subjects.

Send Your assignment brief

Share your assignment brief and after Checking assignment requirement expert Will share the quote

Get Quote and pay

Once quote is sent, you can make Payment through secure option after which our team will start work

Get Assignment

Our team will Deliver the work you can share If any feedback