Re: Q: Cluster File System Reliability



nicc777 <nicc777@xxxxxxxxx> said:
I am rather new to cluster file systems, so let me state my
requirements, and hopefully the kind people here get point me in the
right direction.

Basically I want a file system spanning multiple physical hosts, but
mounted on a server as a single mount point - almost like having some
kind of RAID over various physical nodes. The idea is that if one node
fails, the server should still be able to continue with processing
(assume it's something like a database). I am hoping that when the
failed file server(s) come back online, the "RAID" should be rebuild
on the fly, like a real RAID. This will be first prize.

This doesn't sound what commonly is called "cluster file system";
"cluster file system" (to me, at least), means a set of shared storage
accessible by several computers concurrently (as local storage).
So, the actual storage media will be concurrently connected to
several hosts, and all hosts are able to concurrently access the
file system(s) created on the shared media.

What you describe sounds something I've heard Google having developed
for their search data storage - but I haven't heard of such technology
being used anywhere else.

PS: if anybody knows something about battery backups for hard drives,
and how to protect the hard drive cache in case of power failures etc,
I would also like to hear some of your thoughts/learnings/experiences.

There are storage subsystems (disk racks) from various vendors with
various amounts of local battery-backed cache. As long as the write
to the cache completes, the disk subsystem will be in consistent
state (and when the power resumes, will flush any buffered data to
disks).
--
Wolf a.k.a. Juha Laiho Espoo, Finland
(GC 3.0) GIT d- s+: a C++ ULSH++++$ P++@ L+++ E- W+$@ N++ !K w !O !M V
PS(+) PE Y+ PGP(+) t- 5 !X R !tv b+ !DI D G e+ h---- r+++ y++++
"...cancel my subscription to the resurrection!" (Jim Morrison)
.



Relevant Pages

  • Re: EMC to IBM SAN LUN replication
    ... Rather, trying to 'put it inside the SAN on a storage appliance' has severe limitations, but you don't seem to understand them. ... If you posit a shared-storage file system to allow your applications transparent file-level access, then the observations above about applications apply equally to the file system's internal operation. ... Of course, what you've described isn't a very broadly-useful cache, but just a means of supporting lazy inter-site replication. ... "you need something in the replication layer that understands synchronization issues at the file system" ...
    (comp.arch.storage)
  • Re: EMC to IBM SAN LUN replication
    ... you've characterized as 'emerging technology' is in fact very old hat: VMS had it over two decades ago, IBM had it a decade ago in Parallel Sysplex, other Unixes have been developing it more recently, as well as third-parties: it's Windows that's the real laggard. ... you can use the inter-site VMS cluster as a distributed CIFS file server to serve Windows clients - in the manner that you suggested using a distributed 'storage appliance'. ... Only if a) that storage appliance is interlocking raw block access synchronously and b) you're using higher-level shared-storage distributed file system software for file-level accesses. ... Now, if you're using 'distributed cache' to mean something much more like 'distributed locking mechanism' (which tracks potential synchronization issues such that they can be properly addressed should they occur), then we're just using different terminology to describe the same thing. ...
    (comp.arch.storage)
  • Re: Performance problem (I/O)
    ... This could be due to the way the last 8K allocation ... in space can be obtained by turning the last 8K into a frag, ... storage in multiples of 8K. ... frag'ing would increase the storage usage for that file system by about ...
    (comp.unix.tru64)
  • Re: AdvFS & LSM friends or competitor
    ... But imagine you wished to mirror disks ... > AdvFS is a file system. ... The AdvFS system is journaled so providing ... > AdvFS cannot mirror between its storage systems... ...
    (comp.unix.tru64)
  • A near realtime fs mirror application (for backup, written in Python, by Linux inotify)
    ... This project's mirrord/fs_mirror tool is a near realtime file system ... mirroring application across 2 or more hosts, ... or you can make a mirror chain that a host B mirrors ...
    (Fedora)