The Distributed Computing Model Based On The Capabilities of The Internet
The Distributed Computing Model Based On The Capabilities of The Internet
Identify applicable sponsor/s here. If no sponsors, delete this text box. (sponsors)
Figure 3. Performance comparison of different computational systems.
IV. ADVANTAGES AND DISADVANTAGES OF DISTRIBUTED SYSTEMS information provided by system administrators was collected.
These are most commonly based on the number of uploaded
Distributed systems despite many advantages also have
results in a given timeframe.
disadvantages. They are completely different in contrast to
supercomputers widely adopted in the academic setting, that A simple comparison of supercomputer and a distributed
are typically organized in computing clusters. Distributed system has been shown in Table I.
systems, are not able to solve all the problems, that
supercomputers can handle. The most severe restriction is the
TABLE I. COMPARING THE CHARACTERISTICS OF A DISTRIBUTED SYSTEM AND
data transfer rate. Distributed systems need to relay on the CLASSICAL SUPERCOMPUTERS
Internet connections that are significantly slower than the
Distributed
solutions used in computing clusters (e.g. Infiniband interface). system
Supercomputer
Therefore, tasks should not require sending large chunks of
Reliability Low High
data, in fact, they should provide a long computation times on
modern CPUs with relatively small input information. Another Independence Low High
significant drawback is the calculation uncertainty. One needs Scalability High Mean
to remember, that the data are being processed on third-party
computers, so there is no guarantee that they will be done Cost Very low Very high
correctly. Moreover, there is no assurance that the computation
results will be uploaded back to the server, while the user can
at any time uninstall required software or reinstall their There are many additional components that can help to
operating system. design and build individual distributed platforms. The system
can use precompiled solutions for the middleware, that
facilitate effective communication between different
Significant advantage of the distributed systems, is their components of the system. Most commonly used middleware
relatively low operating cost. Within the Client/Server platforms are CORBA (Common Object Request Broker
architecture only the server administration needs to be handled Architecture), RMI (Remote Method Invocation) and DCOM
and that often means as little as one physical server. The server (Distributed Component Object Model).
is utilized only to assign and manage tasks that are then being
completed by other computers. Low cost of such solutions has V. POSSIBLE SERVER DAEMONS
led to high interest from amateur programmers, individual Certain services need to be offered by a distributed system
scientists and research facilities. server i.e. task generation, distributing the tasks between the
Fig. 3 presents a comparison of a few most important clients and analyzing returned results. Number of daemons, as
distributed systems with three supercomputers that are listed on well as work scheduling, do not need to be evenly distributed,
TOP500. With regards to the overall performance, distributed however most often it does not differ significantly from the
systems are often not lagging far behind the technology used structure used by BOINC.
currently by NASA or the military (that needs significant
a) Assimilator: Assimilator operates on tasks that have
financial capital to operate). In order to create the chart for
supercomputers, LINPACK performance results that measure been finished and their results are already known. This service,
speed of solving a complex system of linear equations were usually saves relevant data regarding the task to the central data
used. On the other hand, for the distributed systems, base system, and when the need arise, it can also delete
temporary data from the database.
b) Transitioner: Transitioner is responsible for analyzing regions where downloading larger amounts of data can result in
tasks’ status. This is the service that e.g. assigns tasks to other long waiting times. Due to the large physical separation, issues
computers when the initial one did not return the results in a related with Internet backbone can also be visible. The term
given timeframe. In the mean time the old task is cancelled. scalability also defines system administration and maintenance.
Moreover, when the same task is sent to a few different Despite the fact that the system is being distributed, it should
computers and different results are returned, Transitioner sends be perceived by the users as one logically consistent system.
it to additional computers to verify computation correctness.
c) Validator: Validator is one of the last daemons
handling task-related operations. This service verifies uploaded
results. Generally, this daemon also assigns points (credits) to
users that are considered an award for the computation
contribution (this is done in the BOINC).
d) Work Generator: Work Generator is a daemon
generating tasks in a fully automated manner, that are then
distributed to other computers. These tasks can be generated
considering computational capabilities of various computers
(available RAM memory, free HDD space, type of CPU). Figure 5. Main problems concerning the scalability of distributed
systems.
e) Others: There are additional daemons that can be
described in the server structure. These daemons can be
responsible for other operations (e.g. cleaning) on files and VII. SECURITY
data base records. Due to their work, despite long operating Similarly to the scalability issue, during the security
times of the system, the amount of stored data does not analysis one can denote smaller, distinct components.
increase indefinitely and waste information are not stored.
Figure 3. sa
REFERENCES
[1] G. Coulouris, J. Dollimore and T. Kindberg, Distributed Systems -
Concepts and Design. U.K.: Addison-Wesley, Fourth Edition, 2005.
[2] B. C. Neuman, Scale in Distributed Systems, Readings in Distributed
Computing Systems. IEEE Computer Society Press, 1994.
[3] B. Jacob, M. Brown, K. Fukui, N. Trivedi, Introduction to Grid
Computing. International Business Machines Corporation, U.S.A.: IBM,
International Technical Support Organization, First Edition, 2005.
[4] D. P. Anderson, BOINC: A System for Public-Resource Computing and
Storage. Proceedings of the Fifth IEEE/ACM International Workshop on
Grid
[5] Computing, 2004.
[6] C. U. Sottrup, J. G. Pedersen, Developing Distrubited Computing
Solutions: Combining Grid Computing and Public Computing. M. Sc.
Thesis, Department of Computer Science, University of Copenhagen,
2005.
[7] R. J. Al-Ali, K. Amin, G. von Laszewski, O. F. Rana, D. W. Walker, M.
Hategan, N. Zaluzec, Analysis and Provision of QoS for Distributed
Figure 10. Number of computers added during certain days
Grid Applications. Kluwer Academic Publishers, 2004.