E Commerce
E Commerce
Computers can communicate with each other when they speak a common language or use a
common communication protocol. Transmission Control Protocol/Internet Protocol (TCP/IP)
is the communication network protocol used on the Internet. TCP/IP has two parts. TCP
handles the transport of data, and IP performs routing and addressing.
Data transport
The two main methods for transporting data across a network are circuit and packet
switching. Circuit switching is commonly used for voice and package switching for data.
Parts of the telephone system still operate as a circuit-switched network. Each link of a
predetermined bandwidth is dedicated to a predetermined number of users for a period of
time.
The Internet is a packet switching network. The TCP part of TCP/IP is responsible for
splitting a message from the sending computer into packets, uniquely numbering each packet,
transmitting the packets, and putting them together in the correct sequence at the receiving
computer. The major advantage of packet switching is that it permits sharing of resources
(e.g., a communication link) and makes better use of available bandwidth.
Routing
Routing is the process of determining the path a message will take from the sending to the
receiving computer. It is the responsibility of the IP part of TCP/IP for dynamically
determining the best route through the network. Because routing is dynamic, packets of the
same message may take different paths and not necessarily arrive in the sequence in which
they were sent.
Addressability
Messages can be sent from one computer to another only when every server on the Internet is
uniquely addressable. The Internet Network Information Center (InterNIC) manages the
assignment of unique IP addresses so that TCP/IP networks anywhere in the world can
communicate with each other. An IP address is a unique 32-bit number consisting of four
groups of decimal numbers in the range 0 to 255 (e.g., 128.192.73.60). IP numbers are
difficult to recall. Humans can more easily remember addresses like aussie.mgmt.uga.edu. A
Domain Name Server (DNS) converts aussie.mgmt.uga.edu to the IP address 128.192.73.60.
The exponential growth of the Internet will eventually result in a shortage of IP addresses,
and the development of next-generation IP (IPng) is underway.
HTML
HTML is a markup language , which means it marks a portion of text as referring to a
particular type of information. HTML does not specify how this is to be interpreted; this is
the function of the browser. Often the person using the browser can specify how the
information will be presented. For instance, using the preference features of your browser,
you can indicate the font and size for presenting information. As a result, you can
significantly alter the look of the page, which could have been carefully crafted by a graphic
artist to convey a particular look and feel. Thus, the you may see an image somewhat
different from what the designer intended.
Electronic publishing
Two common approaches to electronic publishing are Adobe’s portable document format
(PDF) and HTML. The differences between HTML and PDF are summarized in Exhibit 7.
HTML PDF
HTML files can be created by a wide variety PDF files are created using special software sold by
of software. Most word processors can Adobe that is more expensive than many HTML
generate HTML creator alternatives
For example, a user can request that data be analyzed to display a spreadsheet
showing all of a company's beach ball products sold in Florida in the month of
July, compare revenue figures with those for the same products in September and
then see a comparison of other product sales in Florida in the same time period.
Analysts can then perform five types of OLAP analytical operations against
these multidimensional databases:
Pivot. Analysts can gain a new view of data by rotating the data
axes of the cube.
Uses of OLAP
OLAP can be used for data mining or the discovery of previously
undiscerned relationships between data items. An OLAP database does
not need to be as large as a data warehouse, since not all transactional
data is needed for trend analysis. Using Open Database Connectivity
(ODBC), data can be imported from existing relational databases to create
a multidimensional database for OLAP.
OLAP products include IBM Cognos, Oracle OLAP and Oracle Essbase.
OLAP features are also included in tools such as Microsoft Excel and
Microsoft SQL Server's Analysis Services). OLAP products are typically
designed for multiple-user environments, with the cost of
the software based on the number of users.
●Full-Text ePUB
●Linked References
●How to Cite this Article
Management Information Systems Department, Cyprus International University,
Haspolat, Lefkoşa via Mersin, Turkey
ABSTRACT
Huge volume of structured and unstructured data which is called big data,
nowadays, provides opportunities for companies especially those that use
electronic commerce (e-commerce). The data is collected from customer’s
internal processes, vendors, markets and business environment. This paper
presents a data mining (DM) process for e-commerce including the three
common algorithms: association, clustering and prediction. It also
highlights some of the benefits of DM to e-commerce companies in terms of
merchandise planning, sale forecasting, basket analysis, customer
relationship management and market segmentation which can be achieved
with the three data mining algorithms. The main aim of this paper is to
review the application of data mining in e-commerce by focusing on
structured and unstructured data collected thorough various resources and
cloud computing services in order to justify the importance of data mining.
Moreover, this study evaluates certain challenges of data mining like spider
identification, data transformations and making data model comprehensible
to business users. Other challenges which are supporting the slow changing
dimensions of data, making the data transformation and model building
accessible to business users are also evaluated. A clear guide to e-
commerce companies sitting on huge volume of data to easily manipulate
the data for business improvement which in return will place them highly
competitive among their competitors is also provided in this paper.
Keywords:
Data Mining, Big Data, E-Commerce, Cloud Computing
1. Introduction
Data mining in e-commerce is all about integrating statistics, databases and artificial
intelligence together with some subjects to form a new idea or a new integrated
technology for the purpose of better decision making. Data mining as a whole is
believed to be a good promoter of e-commerce. Presently, applying data mining to
e-com- merce has become a hot cake among businesses [1] . Data mining in cloud
computing is the process of extracting structured information from unstructured or
semi unstructured web data sources. From business point of view, the core concept
of cloud computing is to render computing resources in form of service to the users
who need to buy whenever they are in demand [2] . The end product of data mining
creates an avenue for decision makers to be able to track their customers’
purchasing patterns, demand trends and locations, making their strategic decision
more effective for the betterment of their business. This can bring down the cost of
inventory together with other expenses and maximizing the overall profit of the
company.
With the wide availability of the Internet, 21st century companies highly utilize
online tools and technologies for various reasons. Therefore, today many companies
buy and sell through e-commerce and the need for developing e-commerce
applications by an expert who takes responsibility for running and maintaining the
services is increasing. When businesses grow, the required resources for e-
commerce maintenance may increase more than the level the enterprise can handle.
Based on that regard, data mining can be used to handle e-commerce enterprise
services and explore patterns for online customers so companies can boost sales and
the general pro- ductivity of the business [3] . However, the cost of running such
services is a challenge to almost all e-commerce companies. Therefore cloud
computing becomes a game changer in the way and manner companies transact
their businesses by offering a comprehensive scalable and flexible services over the
Internet. Cloud computing provides a new breakthrough for enterprises, offering a
service model that includes network storage, new information resource sharing, on-
demand access to information and processing mechanism. It is possible to provide
data mining software via cloud computing which gives e-commerce companies
opportunity to centralize their software management and data storage with absolute
assurance of reliability, efficiency and protected services to their users which in turn
cut their cost and increase their profit [4] .
Cloud computing is a technology that has to do with accessing products and services
in the cloud without shouldering the burden of hosting or delivering these services.
It can be also viewed as a “model that enhances a flexible on-demand network
access to a shared pool of configurable computing resources like networks, servers,
storage applications and services that can speedily provisioned and released with
minimal management effort or service provider interaction”. In the aspect of cloud
computing everything is considered as a service. There are three service delivery
models of cloud computing namely: Infrastructure as a Service (IaaS) which is
responsible for fundamental computing resources like, storage, processing, networks
and also some standardized services over the networks. The second is the Platform
as a Service (PaaS) which gives abstractions together with the services for
developing, testing, hosting and of course maintaining the applications in the
complex and developed environment. The third one is the Software as the Service
(SaaS). The entire application or service is delivered over the web through a
browser or via application programming interface (API). With service model the
consumers only need to focus on administering users to the system.
One of the most important applications of cloud computing is the storage capability.
Cloud storage has the capability to cluster different types of storage equipment by
employing cluster system, grid technology or distributed system in the network to
provide external data storage and access services by the use of software application.
Cloud computing in e-commerce is the idea of paying bandwidth and storage space
on the scale that depends on the usage. It is much more on the utility on-demand
basis whereby a user pays for less with pay per use models. Most e-commerce
companies welcome the idea as it eliminates the high cost of storage for large
volume of business data by keeping it in the cloud data centers. The platform also
gives opportunity to use e-commerce business applications e.g. B2B and B2C with
smaller investment. Some other advantages of cloud computing for e-commerce
include the following: cost effective, speed of operations, scalability and security of
the entire service [3] [4] .
The association between cloud computing and data mining is that cloud is used to
store the data on the servers and data mining is use to provide client server
relationship as a service and information being collected based on ethical issues like
privacy and individuality are violated [5] .
Considering the importance of data mining for today’s companies, this paper
discusses benefits and challenges of data mining for e-commerce companies.
Furthermore, it reviews the process of data mining in e-com- merce together with
the common types of database and cloud computing in the field of e-commerce.
2. Data Mining
Data mining is the process of discovering meaningful pattern and correlation by
sifting through large amounts of data stored in repositories. There are several tools
for this data generation, which include abstractions, aggregations, summarization
and characteristics of data [6] . In the past decade, data mining has change the e-
commerce business. Data mining is not specific to one type of data. Data mining can
be germane to any type of information source, however, algorithms and tactics may
differ when applied to different kind of data. The challenges presented by different
type of data varies. Data mining is being used in many form of databases like flat
file, data warehouses, object oriented databases and etc.
This paper concentrates on relational databases. Relational database consists of a
set of tables containing either values of entity attributes or values of attributes from
entity relationship. Tables have columns and rows, where columns represent
attributes and rows represent tuples. A tuple in relational table corresponds to either
an object or a relationship between objects and is identified by a set of attribute
values representing a unique key [6] . The most commonly used query language for
relational database is SQL, which allows to manipulate and retrieve data stored in
the tables. Data mining algorithms using relational database can be more versatile
than data mining algorithms specifically written for flat files. Data mining can benefit
from SQL for data selection, transformation and consolidation [7] .
There are several core techniques in data mining that are used to build data mining.
Most common techniques are as follows [8] [9] :
1) Association Rules
Association rule mining is among the most important methods of data mining. The
essence of this method is extracting interesting correlation and association among
sets of items in the transactional databases or other data pools. Association rules
are used extensively in various areas. A typical association rule has an implication of
the form A→B where A is an item set and B is an item set that contains only a single
atomic condition [10] .
2) Clustering
This is the organisation of data in classes or it refers to a collection of objects by
grouping similar objects to form more than one class of methods. Moreover,
clustering class labels are unidentified and it is up to the clustering algorithm to
discover acceptable classes. Clustering is sometimes called unsupervised
classification. The reason was classification is not dictated by given class labels.
Clustering is the process of grouping a set of physical or abstract object into classes
of similar object [10] .
3) Prediction
Prediction has attracted substantial attention given the possible consequences of
successful forecasting in a business context. There are two types of predictions. The
first one is predicting unavailable data values and the second one is as soon as
classification model is form on a training set, the class label of the object can be pre-
dicted based on the attribute values of the object. Prediction is more often referred
to the forecast of missing numerical values [10] .
he first and easier process of data mining is data preprocessing and it is actually a
step before the data mining, whereby, the data is cleaned by removing the
unwanted data that has no relation with the required analysis. Hence, the process
will boost the performance of the entire data mining process and the accuracy of the
data will also be high and the time needed for the actual mining will be minimise
reasonably. Usually this happens if company already have an existing target data
warehouse, but if not then the process will consume at least 80% of the selection,
cleaning and transformation of data termed as preprocessing [13] .
Mining pattern is the second step and it actually refers to techniques or approach
used to develop a recommendation rules, or developing a model out of a large data
set. It can also be referred as techniques or algorithms of data mining. The most
common patterns used in e-commerce are prediction, clustering and association
rules.
The purpose of third step which is pattern analysis is to verify and shade more light
on the discovered model in order to give a clear path for the startup up for applying
of the data mining result. The analysis lay much emphasis on the statistics and rules
of the pattern used, by observing them after multiple users have accessed them
[14] .
However all this has to do with how iterative the overall process is, and the
interpretation of visual information you get at each sub step. Therefore, in general
data mining process iterates from the following five basic steps, which are:
• Data selection: This step is all about identifying the kind of data to be mined, the
goals for it and the necessary tool to enable the process. At the end of it the right
input attributes and output information in order to represent the task are chosen.
• Data transformation: This step is all about organising the data based on the
requirements by removing noise, converting one type of data to another,
normalising the data if there is need to, and also defining the strategy to handle the
missing data.
• Data mining step per se: Having mined the transformed data using any of the
techniques to extract pattern of interest, the miner can also make data mining
method by performing the proceeding steps correctly.
• Result interpretation and validation: For better understanding of data and it
synthesised knowledge together with its validity span, the robustness is check by
data mining application test. The information retrieved can also be evaluated by
comparing it with the earlier expertise in the application domain.
• Incorporation of the discovered knowledge: This has to do with presenting the
result of discovered knowledge to decision maker so that it is possible to compare or
check/resolve for conflict with an earlier extracted knowledge where a new
discovered pattern can be applied [15]
Results
HP has utilized Magento Commerce to expand their online business to 5
different countries. 23% of customers decided to make a purchase online
rather than going to in-person stores, and an excellent user experience is
recorded both from online shoppers and HP Asia-Pacific employees.
2.Mainline Mens wear
concerns
Mainline is an online clothing retailer that distributes many designer brands in
fashion. As Mainline Menswear offers market presence in over 100 countries via
seven custom-built websites and an app, it’s of increasing importance to
continually deliver an excellent web browsing experience. Their main goal was to
complement their current mobile website with native app features that focused on
mobile-friendly design and functionality.
Solution
Mainline Menswear made a strategic decision to build and launch Progressive Web
Apps to provide a smooth and seamless customer experience on the web, as well as
ensure the best performance. They believed transforming the original version of the
Mainline Menswear website to a PWA would enable them to take advantage of the
fast-moving web technology, and ensure the website framework (Nuxt.js, utilizing
Vue.js) would be future-proof.
Results
With the new PWA technology, the Mainline Menswear website enjoyed
a 55% higher conversion rate, and a 243% higher revenue per session in PWA
compared to the old web.