White dot for spacing only
The Dice Project


Task:File-system maps
Group:bill,gdmr,cms
Stage:1


Description

A system for naming and mounting remote filesystems and distributing 
remote filesystem information will be required.
Issues

The existing DCS system uses the am-utils automounter, with filesystem
information being distributed by hesiod maps. am-utils has support
for LDAP. 

Dependencies

Filesystem maps DICE task

Changes

Unresolved Issues

This task is being held up at the moment by 2 things :-

Decisions Made at 19/3/02 DICE meeting

The Current Situation

DCS/LCFG Machines

Essentially file system maps are implemented using the AMD-utils automounter and hessiod. There are methods for generating the maps based on scripts and data files.

Main types of filesystem maps supported currently are :-

Other filesystem types exist, but are largely obsolete. These are discussed further later in this document.

Buccleuch Place/South Bridge/Forrest Hill

All sites use Sun's automount. Buccleuch Place uses NIS+. Forrest Hill and South Bridge use NIS. There may be other types of filesystems that need to be supported for users on these sites. This is being investigated by the Filesystem Issues task, led by Bill Hewitt,but will be discussed briefly later in this document for completeness.

The DICE model

The key features of the DICE model that affect this task are :-

The DICE model for home filesystems
The DICE model for Legacy Filesystems

Once we move to unified home directories (and this can be done before the start of the DICE roll out), we will need to provide a mechanism for accessing data from secondary home directories. The proposal is that these will live under /legacy/domain/home (i.e. /home/{cogsci,dai,dcs}/home. The /legacy directory will only exist for the transition period. As this data will be fairly static, the source can be kept in (a set of) flat file(s). The name of a users home directory under /legacy will be the users legacy name.

Goals of this task

Automounter Issues

There was a choice between AMD and Autofs. Both have suport for LDAP, to varying degrees. A decision was made to go with AMD because

Autofs is currently used by LCFG machines to support removable media (cdrom,floppy). There is no reason why this shouldn't continue to be the case under DICE.

Although a decision was made not to investigate both AMD and autofs, if major problems are found with AMD, we might try to fall back to autofs. It is hoped that this will need not be the case.

Filesystem Issues

Unresolved Filesystem Issues

There are a number of unresolved issues that are preventing finalizing the list of filesystems needed by DICE. We would like to know how the following will be supported in DICE

The other issue is, are we going to allow secondary groups in DICE for data sharing ? AIAI are likely to want to continue to use the shared writable area /project and would need group support for this to work. It would also be easier to manage the transition if other share read/write areas were allowed to exist (under /legacy?) after the introduction of DICE.

Similarly, although we will be recommending downloading corpus data onto local disks via rpms for working on, are we going to allow other corpus data to be browsable on all machines ?

It is likely that there will be a need for shared areas to exist, at least for the transition period.

Which filesystems were considered

We need to come up with a definitive set of filesystems that will be needed in DICE. Once the set has been approved, changes should only be made according to pre-defined procedures.

We considered the following filesystems

Filesystems included in LCFG setup that may not be needed by DICE
Filesystems in use at legacy sites that may need to be supported under DICE

This is under consideration by the filesystem issues task. It is hoped that these may be able to be supported using cvs, the web, and by using rpms for software distribution. There may be some things that will need maps.Candidates for investigation might be

Filesystems needed for laptop support

Laptops need to be able to work both connected to a DICE network and disconnected. They need to provide home directory space for their owner on local disk, but be able to mount filesystems when connected to the network.

Which Filesystems will be needed by DICE

Functionally there are three kinds of filesystem maps that we need to provide in DICE

A minimal set of maps required by DICE is given below. These might be added to later.

Filesystem Naming

This is not a technical issue, but nonetheless we have to decide where filesystems live in DICE and how they are named. It's fairly clear that we want :-

Some thought needs to go on names and locations of mounts for other things. We need to come up with a policy for this. Particular thought ought to go into using /nfs or /amd for system mounts to allow more than one transport to be used at the same time later on. Also thought needs to go in to how to generate names for partitions that are logical, preferably guessable and unique.

LDAP and AMD Issues

This section discusses issues related to implementing filesystem maps and providing LDAP structures to support them. Firstly some fundamentals

We discussed 2 possible models for storing source information

We decided that the first option was more applicable, because the second had difficulties with unified username and legacy username (used in home directory) names being different.

We discussed two kinds of structures that would be needed for storing source information in LDAP

This model matches up quite well with Georges partition types mentioned earlier in the document.

George has done some work on a test implementation of /h and /p for an amalgamation of home directories from dcs and dai. He has scripts to generate dbm files from amd maps.

If there are other maps like /h (glueing several nfs partitions together into one) we would need to define another structure like "people" to support it.

We considered /project as another similar case, but decided that that might be best implemented as /legacy/dai/project, with, if needed, a set of links distributed by rpm to link /project/blah to /legacy/dai/project/blah. as it is in /legacy, no source would be stored in LDAP, so no new structure would be needed.

We considered how the information from LDAP would be updated by COs. Options are :-

We preferred the second case because the master file can have comments and structure that would make it easier to edit.

A pseudo file dumped from LDAP would be in a random order and difficuly to navigate. Having said that, the same is true of NIS+, used at Buccleuch Place...

Proposal and timescales

Proposed work needed
Timescales

Dependencies

Actions

Current
Completed


 : Deploy 

Mini Informatics Logo - Link to Main Informatics Page
Please contact us with any comments or corrections.
Unless explicitly stated otherwise, all material is copyright The University of Edinburgh
Spacing Line