0% found this document useful (0 votes)

5 views2 pages

Compare

This section compares the performance of the proposed machine learning models with existing studies, highlighting their effectiveness in classifying questions based on Bloom's Taxonomy. The results indicate that the ensemble model performed competitively against notable benchmarks, achieving robust accuracy with both 2000 and 4000 samples. Overall, the study emphasizes the advantages of ensemble learning in handling diverse question types and cognitive levels in assessments.

Uploaded by

shamimahmed82245

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views2 pages

Compare

Uploaded by

shamimahmed82245

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

5.

5 Comparison with Existing Studies and Our Proposed Models

Performance Benchmarking Across Datasets

In this section, we compare the performance of our machine learning models with those
reported in previous research. The goal is to determine how well our method works, particularly
when combined with a prior dataset and actual university exam questions.

Below is a comparative table that includes:

● Benchmarked results from recent and notable papers,

● Performance from our proposed models using 2000 and 4000 samples,

● Results from other referenced models (from the image) wherever accuracy is available.

Table 5.1: Comparative Accuracy of Bloom’s Taxonomy Question Classification Models

Author(s) Dataset Model / Method Accurac
y

Mallikarjuna Chindukuri & S. NCERT Ensemble of pre-trained 94.10%

Sivanesan (proposed) models

Yahya et al. Ensemble of pre-trained 92.77%

models

Jain et al. Ensemble of pre-trained 88.50%

models

CLO Ensemble of pre-trained 78.44%

models

Yahya et al. (2012) - TF + SVM 92.30%

Das et al. (2020) - BERT (lr = 0.00003) 89.70%

Laddha et al. (2021) - CNN 80.00%

- LSTM 71.00%

Waheed et al. (2021) 1st dataset BloomNet 87.50%

2nd dataset BloomNet 84.00%

Sharma et al. (2022) - BERT + Dense NN 81.10%

Zhang et al. (2021) - Not specified 59.20%

This Study 2000 dataset Model Ensembling 89.75%

BERT 88.50%

RoBERTa 88.20%

DistilBERT 88.50%

TextCNN 89.75%

This Study 4000 dataset Model Ensembling 88.93%

BERT 87.57%

RoBERTa 87.50%

DistilBERT 86.30%

TextCNN 81.99%

Analysis and Summary

Given the realistic and diverse nature of our dataset, it is clear from the table that our model
ensembling method is quite competitive. While the NCERT dataset yielded the highest result
(94.10%) due to its scale and structure, our combined dataset of real and research-based
questions still performed robustly.

● On 2000 data, our ensemble model matched and in some cases outperformed results
from other BERT-based and CNN models.

● Our ensemble approach demonstrated good scalability and generalization on 4000

data, albeit a modest decline in accuracy.

● Overall, our contextual models and ensemble strategy performed better than other
research that used standalone methods (TF+SVM, CNN, LSTM).

This comparison demonstrates our model's ability to handle a variety of question kinds and
cognitive levels in authentic assessment settings, in addition to highlighting the value of
ensemble learning.

HarvardX PH125X Maching Learning Assessments
100% (1)
HarvardX PH125X Maching Learning Assessments
74 pages
01Options3_TT1_L1
No ratings yet
01Options3_TT1_L1
7 pages
(Ebook) Presentation Secrets by Alexei Kapterev ISBN 9781118034965, 9781118170458, 9781118170472, 1118034961 instant download
No ratings yet
(Ebook) Presentation Secrets by Alexei Kapterev ISBN 9781118034965, 9781118170458, 9781118170472, 1118034961 instant download
48 pages
Portfolio
No ratings yet
Portfolio
9 pages
Heritage and Social Studies Junior Grades 3-7
100% (1)
Heritage and Social Studies Junior Grades 3-7
40 pages
ABECON 1 A and 1 B GE7module Gender and Society1stmodule
No ratings yet
ABECON 1 A and 1 B GE7module Gender and Society1stmodule
8 pages
Psychic discoveries
No ratings yet
Psychic discoveries
1 page
Lord Shanmukha and His Worship - Swami Sivananda
No ratings yet
Lord Shanmukha and His Worship - Swami Sivananda
78 pages
Answer Book (Ashish)
100% (1)
Answer Book (Ashish)
21 pages
Companies Sheet 2xlsx
No ratings yet
Companies Sheet 2xlsx
6 pages
Concentric Cylinders Flow PDF
No ratings yet
Concentric Cylinders Flow PDF
1 page
Explain the Causes for the Development of Untouchability in Early India
No ratings yet
Explain the Causes for the Development of Untouchability in Early India
1 page
Detailed Lesson Plan in New Normal Set Up
No ratings yet
Detailed Lesson Plan in New Normal Set Up
9 pages
Panduan Sinopsis Buku
No ratings yet
Panduan Sinopsis Buku
1 page
Govind Gopal Nair - Bar 1005 - Virtual Architecture
No ratings yet
Govind Gopal Nair - Bar 1005 - Virtual Architecture
24 pages
Solid Edge
No ratings yet
Solid Edge
5 pages
Sample
No ratings yet
Sample
40 pages
Hoarding Install and Dismantle
100% (2)
Hoarding Install and Dismantle
5 pages
Delict Second Semester
No ratings yet
Delict Second Semester
100 pages
Exercise Program Design For Special Population
No ratings yet
Exercise Program Design For Special Population
9 pages
ENG14 Module3 Balota
No ratings yet
ENG14 Module3 Balota
3 pages
Manual de Referencia Hidráulico de HEC-RAS 4.1
100% (1)
Manual de Referencia Hidráulico de HEC-RAS 4.1
417 pages
Laporan Tahunan Annual Report: PT Yanaprima Hastapersada TBK
No ratings yet
Laporan Tahunan Annual Report: PT Yanaprima Hastapersada TBK
118 pages
Lexicographic Behaviour of Chains: Archiv Der Mathematik
No ratings yet
Lexicographic Behaviour of Chains: Archiv Der Mathematik
8 pages
THE PEOPLE OF THE PHILIPPINES, Plaintiff-Appellee, vs. FERNANDO PUGAY BALCITA, & BENJAMIN SAMSON y MAGDALENA, Accused-Appellants
No ratings yet
THE PEOPLE OF THE PHILIPPINES, Plaintiff-Appellee, vs. FERNANDO PUGAY BALCITA, & BENJAMIN SAMSON y MAGDALENA, Accused-Appellants
6 pages
(Q&A) Principles of Legislation and Legislative Drafting.
No ratings yet
(Q&A) Principles of Legislation and Legislative Drafting.
2 pages
AASHTO Supplement, Rigid Pavement Design
No ratings yet
AASHTO Supplement, Rigid Pavement Design
91 pages
DISS Project Proposal Instructions (Adarna)
No ratings yet
DISS Project Proposal Instructions (Adarna)
3 pages
Retraction Ring Leaflet
100% (1)
Retraction Ring Leaflet
2 pages
Ramco HCM
No ratings yet
Ramco HCM
38 pages
A Positive First Impression
No ratings yet
A Positive First Impression
14 pages
CV F Murphy
No ratings yet
CV F Murphy
3 pages
ML - LAB - 7 - Jupyter Notebook
100% (1)
ML - LAB - 7 - Jupyter Notebook
7 pages
T2410230 (2)
No ratings yet
T2410230 (2)
60 pages
Automatic Essay Grading
No ratings yet
Automatic Essay Grading
20 pages
Cse Machine Learning Lab Manual
No ratings yet
Cse Machine Learning Lab Manual
22 pages
s00521-024-10241-y
No ratings yet
s00521-024-10241-y
23 pages
Craft
No ratings yet
Craft
29 pages
Memorization vs Generalization Quantifying Data Le
No ratings yet
Memorization vs Generalization Quantifying Data Le
11 pages
2023 BDCC-- Short Ans. Grading
No ratings yet
2023 BDCC-- Short Ans. Grading
14 pages
Ml Lab Manual
No ratings yet
Ml Lab Manual
46 pages
10+47950+Fikri+Baharudin+Et+Al
No ratings yet
10+47950+Fikri+Baharudin+Et+Al
11 pages
ad3461-ml-lab-manual-format-edited
No ratings yet
ad3461-ml-lab-manual-format-edited
45 pages
DS3001_DAV_Final Exam_Fall23_v3
No ratings yet
DS3001_DAV_Final Exam_Fall23_v3
14 pages
Ranking Approach to Monolingual Question
No ratings yet
Ranking Approach to Monolingual Question
6 pages
IT 804
No ratings yet
IT 804
33 pages
ML-LAB-MANUAL-R20
No ratings yet
ML-LAB-MANUAL-R20
77 pages
2020.tacl-1.37
No ratings yet
2020.tacl-1.37
17 pages
original ML lab manual (1)
No ratings yet
original ML lab manual (1)
22 pages
Updates - Paper AES With CHEM-S (1)
No ratings yet
Updates - Paper AES With CHEM-S (1)
3 pages
2502.19712v1
No ratings yet
2502.19712v1
7 pages
Batch 13 CSE A
No ratings yet
Batch 13 CSE A
35 pages
paper-120
No ratings yet
paper-120
12 pages
ML Lab Programs
No ratings yet
ML Lab Programs
18 pages
20101128, 20101123, 20101115, 20101346_CSE
No ratings yet
20101128, 20101123, 20101115, 20101346_CSE
52 pages
Workflow Task 2.1
No ratings yet
Workflow Task 2.1
15 pages
Project 5
No ratings yet
Project 5
31 pages
Special Topics Final Report.docx
No ratings yet
Special Topics Final Report.docx
22 pages
AUTOMATIC_ASSESSING_OF_BLOOMS_LEVEL_TO_QUESTIONS_USING_BERT_ALG
No ratings yet
AUTOMATIC_ASSESSING_OF_BLOOMS_LEVEL_TO_QUESTIONS_USING_BERT_ALG
5 pages
Assignment (4)
No ratings yet
Assignment (4)
5 pages
Large language models-based metric for generative question answering systems
No ratings yet
Large language models-based metric for generative question answering systems
8 pages
Machine Learning Techniques Lab: Session: 2023-24, Even Semester
No ratings yet
Machine Learning Techniques Lab: Session: 2023-24, Even Semester
20 pages
Bda Assign
No ratings yet
Bda Assign
15 pages
A Quantitative Study of NLP Approaches to Question-1
No ratings yet
A Quantitative Study of NLP Approaches to Question-1
12 pages
hw5_1
No ratings yet
hw5_1
6 pages
Lab 6...
No ratings yet
Lab 6...
8 pages
2410.18693v1
No ratings yet
2410.18693v1
22 pages
Computer Science 2
No ratings yet
Computer Science 2
66 pages
Synthesize Step-by-Step Tools, Templates and LLMs As Data Generators For Reasoning-Based Chart VQA
No ratings yet
Synthesize Step-by-Step Tools, Templates and LLMs As Data Generators For Reasoning-Based Chart VQA
16 pages
Solutions Questionnaire exercises W4-W5
No ratings yet
Solutions Questionnaire exercises W4-W5
21 pages
Pe Active Prompting
No ratings yet
Pe Active Prompting
20 pages
2406.13188v1
No ratings yet
2406.13188v1
9 pages
Others Indigo Case Study PPT
No ratings yet
Others Indigo Case Study PPT
9 pages
Report On - Social Media Research Topic Modeling
No ratings yet
Report On - Social Media Research Topic Modeling
26 pages
Supporting Information
No ratings yet
Supporting Information
9 pages
DocBERT - BERT For Document Classification
No ratings yet
DocBERT - BERT For Document Classification
7 pages
Week7 10
No ratings yet
Week7 10
3 pages
Automated Test Assembly For Handling Learner Cold Start in Large Sclae Assessments
No ratings yet
Automated Test Assembly For Handling Learner Cold Start in Large Sclae Assessments
16 pages
2023-24 AIML ML Mid-Semester Make-Up Answer-Keys
No ratings yet
2023-24 AIML ML Mid-Semester Make-Up Answer-Keys
6 pages
Ifjo 320 Fy 98324 Fo 3 F 2 Ifr
No ratings yet
Ifjo 320 Fy 98324 Fo 3 F 2 Ifr
6 pages
1 s2.0 S2352340924002117 Main
No ratings yet
1 s2.0 S2352340924002117 Main
6 pages
Capstone - Review 2
No ratings yet
Capstone - Review 2
11 pages
ECON 460202E006 MLforBI2 S23o
No ratings yet
ECON 460202E006 MLforBI2 S23o
5 pages
From Human Days To Machine Seconds Automatically Answering and Generating Machine Learning Final Exams
No ratings yet
From Human Days To Machine Seconds Automatically Answering and Generating Machine Learning Final Exams
9 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
31 pages
Data aug-IR
No ratings yet
Data aug-IR
15 pages
Early Prediction of Student Performance Using Neural Networks FINAL
No ratings yet
Early Prediction of Student Performance Using Neural Networks FINAL
55 pages
Are Red Roses Red? Evaluating Consistency of Question-Answering Models
No ratings yet
Are Red Roses Red? Evaluating Consistency of Question-Answering Models
11 pages
Model Determination
No ratings yet
Model Determination
23 pages
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Compare

Uploaded by

Compare

Uploaded by

5.

5 Comparison with Existing Studies and Our Proposed Models

Performance Benchmarking Across Datasets

Below is a comparative table that includes:

● Benchmarked results from recent and notable papers,

Table 5.1: Comparative Accuracy of Bloom’s Taxonomy Question Classification Models

Mallikarjuna Chindukuri & S. NCERT Ensemble of pre-trained 94.10%

Yahya et al. Ensemble of pre-trained 92.77%

Jain et al. Ensemble of pre-trained 88.50%

CLO Ensemble of pre-trained 78.44%

Yahya et al. (2012) - TF + SVM 92.30%

Das et al. (2020) - BERT (lr = 0.00003) 89.70%

Laddha et al. (2021) - CNN 80.00%

Waheed et al. (2021) 1st dataset BloomNet 87.50%

2nd dataset BloomNet 84.00%

Sharma et al. (2022) - BERT + Dense NN 81.10%

This Study 2000 dataset Model Ensembling 89.75%

This Study 4000 dataset Model Ensembling 88.93%

Analysis and Summary

● Our ensemble approach demonstrated good scalability and generalization on 4000

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Compare

Uploaded by

Compare

Uploaded by

5.

5 Comparison with Existing Studies and Our Proposed Models

Performance Benchmarking Across Datasets

Below is a comparative table that includes:

●​ Benchmarked results from recent and notable papers,​

Table 5.1: Comparative Accuracy of Bloom’s Taxonomy Question Classification Models

Mallikarjuna Chindukuri & S. NCERT Ensemble of pre-trained 94.10%

Yahya et al. Ensemble of pre-trained 92.77%

Jain et al. Ensemble of pre-trained 88.50%

CLO Ensemble of pre-trained 78.44%

Yahya et al. (2012) - TF + SVM 92.30%

Das et al. (2020) - BERT (lr = 0.00003) 89.70%

Laddha et al. (2021) - CNN 80.00%

Waheed et al. (2021) 1st dataset BloomNet 87.50%

2nd dataset BloomNet 84.00%

Sharma et al. (2022) - BERT + Dense NN 81.10%

This Study 2000 dataset Model Ensembling 89.75%

This Study 4000 dataset Model Ensembling 88.93%

Analysis and Summary

●​ Our ensemble approach demonstrated good scalability and generalization on 4000

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

● Benchmarked results from recent and notable papers,

● Our ensemble approach demonstrated good scalability and generalization on 4000