IJRIT International Journal of Research in Information Technology, Volume 1, Issue 11, November, 2013, Pg. 333-336

International Journal of Research in Information Technology (IJRIT)

www.ijrit.com

ISSN 2001-5569

Characteristics and Problems Related to a Distributed File System Rishabh Kumar Taneja1, Sahil Sarpal2, Pulkit Arora3 1

Student, Information Technology, Maharishi Dayanand University New Delhi, Delhi, India 2 Student, Information Technology, Maharishi Dayanand University New Delhi, Delhi, India 3 Student, Information Technology, Maharishi Dayanand University New Delhi, Delhi, India 1

[email protected], [email protected], [email protected]

Abstract This paper deals with the storage of large amounts of data comparable to hundreds and thousands of terabytes which is achieved with the help of Distributive File Systems. Distributed file systems are developed to allow users to do real time collaborative work in a distributed environment. Files can be replicated and stored at physically stored locations. It leads to concurrently read and write among numerous numbers of people and has successfully met large storage needs. This paper will illustrate how these file systems are characterized and what are the issues related to them.

Keywords: Caching, Characteristics, Distributed file system, Fault tolerance, Issues.

1. Introduction Since the beginning of computing there has always been problems for the storage of large amounts of data. This problem is still present in today’s powerful CPUs to store huge quantity of data varying to thousands and millions of terabytes. Communication technology of today’s world has allowed number of machines to be aggregated into computing clusters of effectively unbounded processing power and storage capacity that can solve much larger problems than a single machine. A distributed file system is designed for sharing of files. It is a client/server based application that allows clients to access and process data stored on the server as if it were on their own computer. When a user accesses a file on the server, the server sends the user a copy of the file, which is cached on the user's computer while the data is being processed and is then returned to the server. This is the basic functioning of a distributed file system. This paper deals with the characteristics of and problems that may arise while working on a distributed file system.

Sahil Sarpal, IJRIT

333

2. Characteristics of a DFS Every distributed file system has some particular properties that help in deciding whether the file system is good or not. On the basis of these properties only one can judge whether the working of that particular DFS is efficient or not. Some of these are explained below:1.

Concurrency Transparency: An update to a file should not have effect on the correct execution of other process that is concurrently sharing a file.

2.

Replication Transparency: Files may be replicated to provide redundancy for availability and also to permit concurrent access for efficiency.

3.

Fault Tolerance: It is the design that enables a system to continue operation, possibly at a reduced level, rather than failing completely, when some part of a system fails. Every distributed file system needs to be available, reliable, safety and maintainability. Availability is defined as the property that a system is ready to be used immediately. Reliability refers to the property that a system can run continuously without failure. Safety refers to the situation that when a system temporarily fails to operate correctly, nothing catastrophic happens. And maintainability refers to how easy a failed system can be repaired.

4.

Network Transparency: Same access operation as if they are local files, that is, files on a single system.

5.

Location Independence: The file name should not be changed when the physical location f the file changes.

6.

User Mobility: User should be able to access the file from anywhere.

7.

File mobility: Moves file from one place to another in a running system.

These are a few properties that are to be present in a distributed file system in order to make it efficient and trustworthy.

3. Issues Associated With DFS Many distributed file systems undergo problems even after possessing every single property listed above. This happens because of issues that the properties listed above may not clear out completely. These problems can cause disastrous errors and are required to be taken care of before-hand. These are as follows:3.1 Naming This is related with a series of checks. These checks are:1.

The way files are named. This includes considering whether the files are location transparent or not?

2.

Is the file location visible to the user?

3.

Do the names change if the file moves?

4.

Do the names change if the user moves?

Sahil Sarpal, IJRIT

334

3.2 Caching Caching is used for increasing performance. It is to be known that where are the file blocks cached. Whether these file boxes are cached on the file server or on the clients machine or both? Understanding this is very much important. 3.3 Sharing and Coherency There are a few questions that are to be answered under this segment. These are:1.

What are the set of rules for sharing?

2.

What happens when a cached block is modified?

3.

How does a node know when its cached block is out of date?

3.4 Performance This is the most important concern while building any system. Is it performing well? The answers to the following question will definitely help figuring out performance:1.

What is the cost of remote operation?

2.

What is the cost of file sharing?

3.

What are the performance limitations, both software and hardware related?

4.

How does the system scale as the number of clients grow up?

3.5 Replication As many users may be working on the same file at the same time, hence, replication is also another aspect to keep in mind. Is it necessary to think about:1.

Multiple copies of a file in a network?

2.

If there are multiple copies, how are the updates handled?

3.

What if there is a network partition and clients work on separate copies simultaneously?

All these things are to be thought about before implementing a distributed file system. These issues are manageable and when solved makes the entire system very efficient and effective.

Sahil Sarpal, IJRIT

335

4. Conclusions DFS is designed to allow users to efficiently and securely work together in a distributed environment. In such an environment it is needless to say that sometimes users concurrently access and write the same shared file. It allows a team to work over a project at once even though they may be very apart physically. It provides a lot of mobility as files can be accessed from any remote location. All these characteristics can be properly performed only by keeping the issues that may arrive away. To conclude, we would say that a distributed file system is a very important part of technology now and has a lot of advantages. By solving the problems, they are a very useful tool and that is why they are thoroughly used a lot.

5. Acknowledgments We would like to thank the entire faculty of IT Department, Dronacharya College of Engineering, Gurgaon, Haryana for their support, encouragement and patience. Special thanks to Dr. Jitender Kumar for encouraging us to write this paper.

6. References [1] Philip S. Yeager, “A distributed File System for Distributed Conferencing System”, M.S., Thesis: Partial, Computer and Information Science and Engineering, University of Florida, Gainesville, Florida, 2003. [2] Ed Lazowska, “CSE 451: Operating Systems Spring 2005”, University of Washington, 2005. [3] Phani Vamsi Krishna Maddali, “Characteristics of distributed file system”, Georgia State University, Atlanta, Georgia. [4] http://searchcio-midmarket.techtarget.com

Sahil Sarpal, IJRIT

336

Characteristics and Problems Related to a Distributed ...

All these things are to be thought about before implementing a distributed file system. ... of IT Department, Dronacharya College of Engineering, Gurgaon, Haryana ... Computer and Information Science and Engineering, University of Florida, ...

78KB Sizes 0 Downloads 216 Views

Recommend Documents

Characteristics and Problems Related to a Distributed File System
computer. When a user accesses a file on the server, the server sends the user a copy of the file, which is cached on the user's computer while the data is being processed and is then returned to the server. This is the basic functioning of a distrib

Improved Algorithms for Orienteering and Related Problems
approximation for k-stroll and obtain a solution of length. 3OPT that visits Ω(k/ log2 k) nodes. Our algorithm for k- stroll is based on an algorithm for k-TSP for ...

Improved Algorithms for Orienteering and Related Problems
Abstract. In this paper we consider the orienteering problem in undirected and directed graphs and obtain improved approximation algorithms. The point to ...

On Distributed and Parameterized Supervisor Synthesis Problems
conference version has appeared in [7]. ... v0,w = vn and for each i ∈ [1,n], there exist ui,ui and ai,bi such that (ai,bi) ∈ I,vi−1 = uiaibiui and vi = uibiaiui. The.

On Distributed and Parameterized Supervisor Synthesis Problems
regular language has a non-empty decomposable sublanguage with respect to a fixed ..... Proof: It is clear that the supremal element L. ↑. 1 of {L1 ⊆. Σ∗.

Chapter 3 Sparse Distributed Memory and Related ...
extensively since early 1900s, its role in fine motor control has been established, and its physiology is still studied intensively (Ito ...... implementations have used standard logic circuits and memory chips (Flynn et al. 1987) and programmable ga

Improved Algorithms for Orienteering and Related Problems - Martin Pál
arise in transportation, distribution of goods, scheduling of work, etc. ..... 2When we use the k-stroll algorithm as a subroutine, we call it with .... The center.

Possible Types of Prescription Related Drug Problems and Its ...
Possible Types of Prescription Related Drug Problems ... ts Detection in Community Pharmacies in Sulaimani.pdf. Possible Types of Prescription Related Drug ...

Improved Algorithms for Orienteering and Related Problems - Martin Pál
In concurrent and independent work, Nagarajan and. Ravi [26] obtained an ..... dynamic programming, and we use our new algorithms in the large-excess ...

Vocabulary related to UNEXPLAINED PHENOMENA AND EVENTS ...
There was a problem previewing this document. Retrying... Download ... Vocabulary related to UNEXPLAINED PHENOMENA AND EVENTS.pdf. Vocabulary ...

APPENDIX I. VOCABULARY RELATED TO CRIME AND ...
APPENDIX I. VOCABULARY RELATED TO CRIME AND PUNISHMENT.pdf. APPENDIX I. VOCABULARY RELATED TO CRIME AND PUNISHMENT.pdf. Open.

Using the Internet: Skill related problems in users ... - Semantic Scholar
Jun 30, 2009 - achieve the best means for the goal desired. After the right ..... To identify individual lay-out and website design related prob- lems, subjects ...

Using the Internet: Skill related problems in users ... - Semantic Scholar
Jun 30, 2009 - Government Information Service's website in the original window, even when it was still visible ..... b Measured during information assignments (free surfing). References. Aula, A. ... Virtual Inequality: Beyond the. Digital Divide.

A distributed system architecture for a distributed ...
Advances in communications technology, development of powerful desktop workstations, and increased user demands for sophisticated applications are rapidly changing computing from a traditional centralized model to a distributed one. The tools and ser

A New Data Representation Based on Training Data Characteristics to ...
Sep 18, 2016 - sentence is processed as one sequence. The first and the second techniques are evaluated with MLP, .... rent words with the previous one to represent the influence. Thus, each current input is represented by ...... (better disting

On resource allocation problems in distributed MIMO ...
Dec 14, 2010 - Energy-efficient communications for single-user MIMO .... networks of multi-antenna terminals”, Springer Telecommunications Systems Journal, ...

Initiative: 1691, Related to Taxes
Jul 9, 2015 - to determine the total number of signatures affixed to the petition and to transmit the total to the ... (Elections Code §§ 9030(d) and (e))…

Seed dispersal and changing seed characteristics in a ... - Springer Link
from the crown seed-bank, whereas seeds stored in the soil are .... account for the larger seed densities observed in S2. .... C.A.B. International, Wallingford, UK.

Characteristics and dynamics of a regional moose Alces alces 2010 ...
located within the Androscoggin River watershed in. the town of Milan ... Characteristics and dynamics of a regional moose Alces alces 2010.pdf. Characteristics ...

Initiative: 1745, Related to Marijuana
Dec 22, 2015 - to determine the total number of signatures affixed to the petition and to transmit ... e. Last day for county to determine total number of qualified.

Anatomic and Functional Characteristics of a Slow ...
reflected by a jump in their AV nodal recovery curve. The jump reflects the .... along the PNE as well as from nearby transitional tissues. The ... resolution and a 0.47-ms precision with a computer algorithm.55. Stimulation ... Instruments) and anal

Particle Concentration and Characteristics near a Major Freeway with ...
vehicles in the Los Angeles highway network. Particle concentration measurements were accompanied by measurements of black carbon, elemental and organic carbon, and gaseous species (CO, CO2). Using the incremental increase of CO2 over the background

Anatomic and Functional Characteristics of a Slow ...
reflected by a jump in their AV nodal recovery curve. The jump reflects the shift of the .... Instruments) and analyzed with the Data-Pac II program (Run. Technologies). ...... Ho SY, McComb JM, Scott CD, Anderson RH. Morphology of the.