0% found this document useful (0 votes)
31 views

Universitas Islam Makassar Fakultas Keguruan Dan Ilmu Pendidikan Soal Mid Semester Ganjil Ta. 2019/2020 Soal Ujian Penjaminan Kualitas (Upk)

The document is an abstract for a research paper about dimensional reduction methods for text clustering. It discusses how high feature space is an issue for text clustering and various reduction methods have been introduced to select informative subfeatures. Typically, union and intersection methods are used to combine subfeatures selected by different reduction methods, but union increases dimensions while intersection loses some important features. Therefore, the research proposes a modified union approach that applies the union method to top ranking features and intersection to the rest, selecting features using Term Variance and Document Frequency methods. The effectiveness was tested on a Hadith data set, and results showed the proposed method improved clustering accuracy over other methods with a DB index of 2.7.

Uploaded by

Mega
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
31 views

Universitas Islam Makassar Fakultas Keguruan Dan Ilmu Pendidikan Soal Mid Semester Ganjil Ta. 2019/2020 Soal Ujian Penjaminan Kualitas (Upk)

The document is an abstract for a research paper about dimensional reduction methods for text clustering. It discusses how high feature space is an issue for text clustering and various reduction methods have been introduced to select informative subfeatures. Typically, union and intersection methods are used to combine subfeatures selected by different reduction methods, but union increases dimensions while intersection loses some important features. Therefore, the research proposes a modified union approach that applies the union method to top ranking features and intersection to the rest, selecting features using Term Variance and Document Frequency methods. The effectiveness was tested on a Hadith data set, and results showed the proposed method improved clustering accuracy over other methods with a DB index of 2.7.

Uploaded by

Mega
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

UNIVERSITAS ISLAM MAKASSAR

FAKULTAS KEGURUAN DAN ILMU PENDIDIKAN


SOAL MID SEMESTER GANJIL TA. 2019/2020
SOAL UJIAN PENJAMINAN KUALITAS (UPK)

Mata Kuliah : English-Indonesia Translation


Kelas / Semester : A / IV
Hari / Tanggal : 13 April 2020
Waktu : 90 Menit
Program Studi : Pendidikan Bahasa Inggris
Pengampu M.K : Sitti Nurjannah S.Pd., M.Pd.
Jumlah Mahasiswa : Mahasiswa

Translate the following abstract and analyse the technique and strategies that you use in translate the
abstract!

Abstract
The high feature space (dimension) is one of the main issues to be considered in the text clustering
process. Therefore, various dimensional reduction methods have been introduced for selecting
informative sub feature. Each method uses a different strategy to select sub feature, and the results are
different even if using the same data set. Typically, union methods and intersection methods are used
to combine selected sub feature with different reduction methods. The union method selects all feature
and intersection only selects the general feature under consideration. Thus, the union approach causes
an increase in feature dimensions and the intersection approach causes the loss of some important
feature. Therefore, in order to take advantage of a method and reduce its weaknesses, this research
proposes new approach, which are called modified union. This approach applies the union methods to
select top ranking feature and applies intersection methods to the rest of the feature. In this case,
feature selection uses the Term Variance (TV) and Document Frequency (DF) methods to calculate the
relevance value of each feature. The effectiveness of the proposed method is tested on the data set of
Hadith Shahih Bukhary. The results show that the proposed method improves clustering accuracy over
other methods with DB index is 2.7.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy