Shared disk file system

Shared disk file system

A shared disk file system, also known as clustered file system or SAN file system, is an enterprise storage file system which can be shared (concurrently accessed for reading and writing) by multiple computers. Such devices are usually clustered servers, which connect to the underlying block device over an external storage device. Such a device is commonly a storage area network (SAN).

Shared disk file systems are necessary because with regular file systems, if multiple instances (servers) were to attempt concurrent access to the same physical device, the data would rapidly become corrupt, because there is nothing to prevent two devices from performing a modification of the same part of the file system at the same time. Conventional file locking is no aid in this, as file locking operates above the file system level; file locking can protect files against concurrent access but offers no protection when server / operating system manipulates the file system directly. Directly means the server manipulates blocks of storage device. The files in the file system consist of these blocks.

A shared file system extends the file system concept by adding a mechanism for concurrency control. It provides each device accessing the file system with a consistent and serializable view of the file system, avoiding corruption and [unintended] data loss. Such file systems also usually employ some sort of a fencing mechanism to prevent data corruption in case of node failures.

There are different architectural approaches to a shared disk file system. Some distribute file information across all the servers in a cluster (fully distributed). Others utilize a centralized metadata server. Both achieve the same result of enabling all servers to access all the data on a shared storage device.

Examples of such file systems include:
* Veritas Cluster File System
* Xsan
* Global File System
* Oracle Cluster File System
* VMware VMFS

Comparison of shared file systems

Shared disk file systems were introduced in the early 1980s, predominantly in VAX VMS clusters. They rely on a SAN, usually based on Fibre Channel, iSCSI or InfiniBand technology.

The IBM General Parallel File System (GPFS), PolyServe storage solutions, Silicon Graphics clustered file system (CXFS), Red Hat Global File System (GFS) and TerraScale Technologies TerraFS are all SAN-based shared file systems. The architecture of these file systems mirrors that of a local disk file system. Performance for a single client is good, although concurrent behavior is limited by an architecture that is not optimized for scalability.

These systems offer failover with varying degrees of robustness. GPFS has been successful for clusters of up to a few hundred nodes.

Typically, SAN performance on Fibre Channel is reasonable, but it cannot compete with clients that use InfiniBand, Quadrics or Myricom networks with native protocols.To limit scalability issues encountered by shared disk file systems, systems such as GPFS, CxFS, GFS and PolyServe Matrix Server are often used on an I/O sub-cluster that exports NFS. Isilon Systems offers an appliance for this purpose. Each of the I/O nodes then exports the file system through NFS version 2 or 3.

For NFS version 4, such exports are more complex due to the requirement for managing shared state among the NFS servers. While the scalability of NFS is improved, layering introduces further performance degradation, and NFS failover is rarely completely transparent to applications. NFS also does not support POSIX semantics. A well-tuned Lustre cluster will normally out-perform a NFS protocol-based cluster [Cope, Jason et al. [http://www.csc.cs.colorado.edu/~tufo/pubs/tufo-2005-lci.pdf “Shared Parallel Filesystems in Heterogeneous Linux Multi-Cluster Environments”] , "Proceedings of the 6th LCI International Conference on Linux Clusters: The HPC Revolution", Chapel Hill, North Carolina, April 2005.] .

Several systems offer novel architectures to address scalability and performance issues. Ibrix offers a symmetric solution, but little is publicly known about its architecture, semantics, and scalability. Panasas offers a server hardware solution, combined with client file system software. It makes use of smart object iSCSI storage devices and a metadata server that can serve multiple file sets. Good scaling and security are achievable, even though all file locking is done by a single metadata server. The Panasas system uses TCP/IP networking. Lustre’s architecture is similar, but is an open source, software-only solution running on commodity hardware. The Lustre file system has been scaled to 25,000 clients.

ee also

* network attached storage: NAS protocols (like SMB and NFS) are similar to SAN file systems in that they allow multiple computers to access the same file system (and even file) at the same time, but unlike them, NAS require simultaneous physical access to the storage device to all computers in the cluster.
* Distributed file system, different approach
* Shared resource
*
* [http://www.furquim.org/chironfs/index.en.html ChironFS] , a network replicated filesystem in Userspace.
* List of file systems, Distributed file systems section

References


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

  • File system — For library and office filing systems, see Library classification. Further information: Filing cabinet A file system (or filesystem) is a means to organize data expected to be retained after a program terminates by providing procedures to store,… …   Wikipedia

  • Clustered file system — A clustered file system is a file system which is shared by being simultaneously mounted on multiple servers. There are several approaches to clustering, most of which do not employ a clustered file system. While many computer clusters don t use… …   Wikipedia

  • Lustre (file system) — Infobox software name = Lustre developer = Sun Microsystems latest release version = 1.6.5.1 latest release date = release date|2008|07|10 operating system = Linux genre = Shared disk file system license = GPL website = http://www.lustre.org,… …   Wikipedia

  • IBM General Parallel File System — Infobox Software name = IBM GPFS caption = developer = IBM latest release version = 3.2.1 6 latest release date = September 2008 operating system = AIX / Linux / Microsoft Windows Server 2003 R2 genre = filesystem license = Proprietary website =… …   Wikipedia

  • Global File System — In computing, the Global File System (GFS) is a shared disk file system for Linux computer clusters.GFS differs from distributed file systems (such as AFS, Coda, or InterMezzo) because it allows all nodes to have direct concurrent access to the… …   Wikipedia

  • Distributed file system — For other uses, see Distributed file system (disambiguation). In computing, a distributed file system or network file system is any file system that allows access to files from multiple hosts sharing via a computer network.[1] This makes it… …   Wikipedia

  • StorNext File System — is a shared disk file system made by Quantum Corporation. It is installed on disk arrays in a storage area network (SAN). Client systems attached to the SAN are not required to run the same operating system to access StorNext data. As of January… …   Wikipedia

  • Distributed file system (disambiguation) — A distributed file system is a file system where data is distributed across multiple nodes. Distributed file system can also refer to: Shared disk file system, a different approach, also known as cluster file system. Distributed File System… …   Wikipedia

  • Veritas File System — For other uses, see Veritas (disambiguation). VERITAS File System Full name VERITAS File System Introduced 1991 Structures Directory contents extensible hash Limits Max file size 8 EB ( …   Wikipedia

  • Shared resource — In computing, a shared resource or network share is a device or piece of information on a computer that can be remotely accessed from another computer, typically via a local area network or an enterprise Intranet, transparently as if it were a… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”