Distributed System Essay

3059 words - 13 pages

Summary from the Papers:
Cloud computing is the latest evolution of Internet-Based Computing. Public internet spawned private corporate intranets, cloud computing is now spawning private cloud platforms. The database is the critical part of that platform. Therefore it is imperative that our cloud database be compatible with cloud computing.
Key Design principles of the cloud model:
The core design principle is dynamic scalability, or the ability to provision and decommission servers on demand. The shared-disk database architecture is ideally suited to cloud computing. It requires fewer and lower cost servers, it provides high availability, reduces ...view middle of the document...

This provides Atomicity, Consistency, Isolation and Durability. Dynamic Scalability has proven to be vexing problem for databases because most databases use Shared-nothing architecture which relies on splitting the data into separate silos of data, one per server. This Partitioning of data is a time-consuming process and decrease database performance if the partitioning is not done carefully to minimize data shipping. Each server must be run at low CPU utilization in order to be able to accommodate spikes in usage for that server’s data. So, need to buy expensive servers to handle the peaks. Each individual node of Shared-nothing Architecture owns a specific piece of data. Take out one server in a shared-nothing database and entire cluster must be shut down.
However, the Shared-Disk Database Architecture is ideal for cloud computing which eliminates the need to partition data and support elastic scalability. This architecture allows clusters of low-cost servers to use a single collection of data, typically served up by a Storage Area Network (SAN) or Network Attached Storage (NAS).
Advantage of Shared-Disk Database Architecture:
a. Fewer servers required: Shared-disk is master-master configuration rather than master-slave configuration. So, each node provides fail-over for the other nodes. This reduces the number of servers required by half.
b. Lower cost servers: It spreads the spikes across the entire cluster. As a result, each system can be run at a higher CPU utilization. One can purchase low-cost commodity server.
c. Scale-in: It enables to allocate and bill customers on the basis of instances of database being run on a multi-core machine.
d. Simplified maintenance/upgrade process: Servers can be upgraded individually while the cluster remains online.
e. High Availability
f. Reduced partitioning and tuning services
g. Reduced support costs
With the advent of high-throughput experimental technologies and of high speed internet connections, generation and transmission of large volumes of data has been automated over the last decade. Modern automated methods for measurement, collection and analysis of data in all fields of science, industry and economy are providing more and more data with drastically increasing complexity of its structure. Representing complex objects by means of simple objects like numerical feature vectors could be understood as a way to incorporate domain knowledge into the data mining process. “Data mining”, often also referred to as “Knowledge Discovery in Databases” aims at the automatic interpretation of large datasets. In order to able to identify patterns in data, a step of preprocessing of the data is almost always required. This processing has a significant impact on the runtime and on the results of the subsequent data mining algorithm. In order to manage the huge volume of such complex data, database systems are employed. Thus, databases provide and...

Other Papers Like Distributed System

Exercise 1.6 Scrooge Mcnugget Wants to Store Information (Names, Addresses, Descriptions of Embarrassing Moments, Etc.) About the Many Ducks on His Payroll. Not Surprisingly, the Volume of Data...

788 words - 4 pages of the following DBMS features Scrooge should pay for; in each case also indicate why Scrooge should (or should not) pay for that feature in the system he buys. 1. A security facility. 2. Concurrency control. 3. Crash recovery. 4. A view mechanism. 5. A query language. Answer 1.6 Answer omitted Exercise 21.15 Consider a heterogeneous distributed DBMS. 1. Define the terms multidatabase system and gateway. 2. Describe how queries that span

Software Engineering Essay

873 words - 4 pages 1. • Five advantages are Incremental growth; computing power can be added in small increments. • Reliability; if one machine crashes, the system as whole can still survive. • Speed; a distributed system may have more total computing power than a mainframe. • Open system; an open system is open to communicate with other systems. And open system that scales has an advantage over a closed and self-contained economic system. 2. Middleware in

Business Intelligence, Distibuted Data and Analytical Thinking

617 words - 3 pages Business Intelligence, Distributed PAGE \* MERGEFORMAT 3 Running head: BUSINESS INTELLIGENCE, DISTRIBUTED DATA ANDBusiness Intelligence, Distributed Data and Analytical ProcessesDon CarterGrand Canyon UniversityBusiness Intelligence, Distributed Data and Analytical ProcessesIntroduction: In today's business world many of the processes we used to perform locally are now moving into the on line environment. This shift has allowed organizations

System Fail

865 words - 4 pages catastrophic events that are costly in both monetary and emotional aspects. Faults within a system can be different in nature and can continue to wreak havoc on the system or to do nothing in some cases. Failures in distributed and centralized system can be specific in nature; however a generalized topic can describe the majority of these failures across the system. There are four types of failures in distributed systems that can affect functionality

Igeek, Inc. Releases Indelible Fs

660 words - 3 pages  working  infrastructure. Indelible FS provides a complete storage system, including traditional file system, object  storage and virtual disks. It is scalable, distributed and easy to use. Indelible FS integrates backup and data  deduplication into the basic file system removing the need for additional software for data safety. Indelible  FS is distributed, caching data where it needs to be accessed automatically. David Smith­Uchida, iGeek's

Chapter1-Thesis-Salesandinventory

1493 words - 6 pages manually the total number of magazines and newspapers distributed. Then the sales report and distribution report will be provided based on the existing records (inventory distribution record), receipt and returned scraps. However, since they are only using calculator to compute the total number of products distributed to Computerized Management and Monitoring System for Montejo News Paper Supply and Magazine Distributor I-3 their outlets and

Distributed Computing: What Is It And Can It Be Useful

1857 words - 8 pages spare CPU cycles, and sends the results back to the server. The application may run as a screen saver, or simply in the background, without impacting normal use of the computer. If the user of the client system needs to run his own applications at any time, control is immediately returned, and processing of the distributed application package ends.3 This must be essentially instantaneous, as any delay in returning control will probably be

Pest Analysis of Italian Environment

527 words - 3 pages railways (16th in the world), 484.688 km of roadways (11th in the world), and 133 airports distributed on the territory. The telephone system is modern and well developed, fax is widely used and mobile signal covers almost all the italian territory. All the information come from the website www.indexmundi.com and are accurate as of 2007

Case Study Lego

1493 words - 6 pages thousands of employees.  Using the SAP query tool, users are able to toggle and tabulate data and extract reports from the database.  Therefore the database’s role is also as a reports generator. Explain why distributed architectures are flexible A distributed architecture system is a software system in which components located on networked computers communicate and coordinate their actions by passing messages. The components interact with

Types of Information System

1384 words - 6 pages cases, part of the backbone of the overall IT infrastructure within an organization. • They are usually part of a pure mainframe system or a distributed system in which the mainframe plays the major role. Newer legacy systems may include one or more LANs and even client/server implementations. • Legacy systems were developed from the late 1950s through the 1980s for general-purpose business use in medium- to large-size companies. They were the

Lab Validation Report

5252 words - 22 pages  language.   Designed  around  horizontal  scale-­‐out  on  commodity  hardware,  MemSQL’s  two-­‐tiered  architecture  is  comprised  of   two  types  of  nodes:  aggregators  and  leaves.  Aggregator  nodes  are  cluster-­‐aware  query  routers  that  act  as  a   gateway  into  the  distributed  system.  The  only  data  that

Related Essays

Failures Of A Distributed System Essay

1151 words - 5 pages Failures of a Distributed System POS/355 July 25, 2013 Failures of a Distributed System In the words of Adam Savage from Mythbusters, “failure is always an option”. This holds true when talking about a distributed system, which is a computer network like a Wide Area Network (WAN) or a Local Area Network (LAN). Distributed systems is defined as a software system in which components located on networked computers communicate and coordinate

Assignment

1439 words - 6 pages | |ASSIGNMENT TITLE |: |City Bank Distributed Design System | |UNIT OUTCOMES COVERED |: | | | 35.1 Understand Microsoft architecture for enterprise applications

Hadoop Essay

276 words - 2 pages Rack is collection of 30-40 nodes. Collection of Rack is Cluster. Hadoop Architecture Two Components * Distributed File System * Map Reduce Engine HDFS Nodes * Name Node * Only one node per Cluster * Manages File system, Name Space and Metadata * Single point of Failure but mitigated by writing to multiple file systems * Data Node * Many per cluster * Manages blocks with data and serves

Distributed Database Essay

885 words - 4 pages Distributed database is one in which multiple database sites are linked by a communications system in such a way that the data at any site is available to users at other sites. Depending on the needs of an organization the following reasons would be why they would have the needs of distributed databases: * Improved reliability. A distributed system is more reliable than a centralized one, because processing is done at several sites, so