Using grouping patterns
[Documentation for users]

This part answers the "why" and "how" for ignoring entries. More...

This part answers the "why" and "how" for ignoring entries.

Patterns are used to define groups for new entries; a group can be used to ignore the given entries, or to automatically set properties when the entry is taken on the entry list.

So the auto-props are assigned when the entry gets known; that happens for the add, prop-set or prop-del, and of course commit commands.
So, to override the auto-props of some new entry, just use the property commands.

Overview

When FSVS goes through your working copy it tries to find new (ie. not yet versioned) entries. Every new entry (and only new entries) gets tested (in the given order!) against the defined grouping patterns; if a pattern matches, the corresponding group is assigned to the entry, and no further matching is done.

See also entry statii.

Predefined group 1: "ignore"

If an entry gets a group named "ignore" assigned, it will not be considered for versioning.

This is the only really special group name.

Predefined group 1: "take"

This group mostly specifies that no further matching is to be done, so that later ignore patterns are not tested.

Basically the "take" group is an ordinary group like all others; it is just predefined, and available with a short-hand notation.

Why should I ignore files?

Ignore patterns are used to ignore certain directory entries, where versioning makes no sense to the user. If you're versioning the complete installation of a machine, you wouldn't care to store the contents of /proc (see man 5 proc), or possibly because of security reasons you don't want /etc/shadow , /etc/sshd/ssh_host_*key , and/or other password-containing files.

Ignore patterns allow you to define which directory entries (files, subdirectories, devices, symlinks etc.) should be taken respectively ignored.

Why should I assign groups?

The grouping patterns can be compared with the auto-props feature of subversion; it allows automatically defining properties for new entries, or ignoring them, depending on various criteria.

For example you might want to use encryption for the files in your users' .ssh directory, to secure them against unauthorized access in the repository, and completely ignore the private key files:

Grouping patterns:

    group:ignore,/home/*/.ssh/id*
    group:encrypt,/home/*/.ssh/**
And the $FSVS_CONF/groups/encrypt file would have a definition for the fsvs:commit-pipe (see here).

Syntax of group files

A group definition file looks like this:

An example:

   # This is a comment
     # This is another

   auto-props    fsvs:commit-pipe    gpg -er admin@my.net

   # End of definition

Specification of groups and patterns

While an ignore pattern just needs the pattern itself (in one of the formats below), there are some modifiers that can be additionally specified:
   [group:{name},][dir-only,][insens|nocase,][take,][mode:A:C,]pattern
These are listed in the section Modifiers below.

These kinds of ignore patterns are available:

Shell-like patterns

These must start with ./, just like a base-directory-relative path. ? , * as well as character classes [a-z] have their usual meaning, and ** is a wildcard for directory levels.

You can use a backslash \ outside of character classes to match usually special characters literally, eg. \* within a pattern will match a literal asterisk character within a file or directory name. Within character classes all characters except ] are treated literally. If a literal ] should be included in a character class, it can be placed as the first character or also be escaped using a backslash.

Example for / as the base-directory

     ./[oa]pt
     ./sys
     ./proc/*
     ./home/**~

This would ignore files and directories called apt or opt in the root directory (and files below, in the case of a directory), the directory /sys and everything below, the contents of /proc (but take the directory itself, so that upon restore it gets created as a mountpoint), and all entries matching *~ in and below /home .

Note:
The patterns are anchored at the beginning and the end. So a pattern ./sys will match only a file or directory named sys. If you want to exclude a directories' files, but not the directory itself, use something like ./dir/* or ./dir/**
If you're deep within your working copy and you'd like to ignore some files with a WC-relative ignore pattern, you might like to use the rign command.

Absolute shell patterns

There's another way to specify shell patterns - using absolute paths. The syntax is similar to normal shell patterns; but instead of the ./ prefix the full path, starting with /, is used.

         /etc/**.dpkg-old
         /etc/**.dpkg-bak
         /**.bak
         /**~

The advantage of using full paths is that a later dump and load in another working copy (eg. when moving from versioning /etc to /) does simply work; the patterns don't have to be modified.

Internally this simply tries to remove the working copy base directory at the start of the patterns; then they are processed as usually.

If a pattern does not match the wc base, and neither has the wild-wildcard prefix /**, a warning is issued; this can be handled as usual.

PCRE-patterns

PCRE stands for Perl Compatible Regular Expressions; you can read about them with man pcre (if the manpages are installed), and/or perldoc perlre (if perldoc is installed)

These patterns have the form PCRE:{pattern} (with PCRE in uppercase, to distinguish from modifiers).

An example:

     PCRE:./home/.*~
This one achieves exactly the same as ./home/**~ .

Another example:

     PCRE:./home/[a-s]

This would match /home/anthony , /home/guest , /home/somebody and so on, but would not match /home/theodore .

Note that the pathnames start with ./ , just like above, and that the patterns are anchored at the beginning. To additionally anchor at the end you could use a $ at the end.

Ignoring all files on a device

Another form to discern what is needed and what not is possible with DEVICE:[<|<=|>|>=]major[:minor].

This takes advantage of the major and minor numbers of inodes (see man 1 stat and man 2 stat).

The rule is as follows:

This is because the mount-point (ie. the directory, where the other filesystem gets attached) should be versioned (as it's needed after restore), but all entries (and all binding mounts) should not.

The possible options <= or >= define a less-or-equal-than respective bigger-or-equal-than relationship, to ignore a set of device classes.

Examples:

     tDEVICE:3
     ./*
This patterns would define that all filesystems on IDE-devices (with major number 3) are taken , and all other files are ignored.

    DEVICE:0
This would ignore all filesystems with major number 0 - in linux these are the virtual filesystems ( proc , sysfs , devpts , etc.; see /proc/filesystems , the lines with nodev ).

Mind NFS and smb-mounts, check if you're using md , lvm and/or device-mapper !

Note: The values are parsed with strtoul() , so you can use decimal, hexadecimal (with 0x prepended) and octal (with 0 prepended) notation.

Ignoring a single file, by inode

At last, another form to ignore entries is to specify them via the device their on and their inode:
     INODE:major:minor:inode
This can be used if a file can be hardlinked to many places, but only one copy should be stored. Then one path can be marked as to take , and other instances are ignored.
Note:
That's probably a bad example. There should be a better mechanism for handling hardlinks, but that needs some help from subversion.

Modifiers

All of these patterns can have one or more of these modifiers before them, with (currently) optional "," as separators; not all combinations make sense.

"take": Take pattern

This modifier is just a short-hand for assigning the group take.

"insens" or "nocase": Case insensitive

By using this you can force the match to be case-insensitive; this can be useful if other machines use eg. samba to access files.

"dironly": Match only directories

Match directories only. This is useful if you have a directory tree in which only certain files should be taken; see below.

"mode": Match entries' mode

This expects a specification of two octal values in the form m:and_value:compare_value, like m:04:00; the following examples give only the numbers.

As an example: the file has mode 0750; a specification of

A real-world example: 0007:0000 would match all entries that have no right bits set for "others", and could be used to exclude private files (like /etc/shadow). (Alternatively, the others-read bit could be used: 0004:0000.

FSVS will give an error for invalid specifications, ie. ones that can never match; an example would be 0700:0007.

For patterns with the m (mode match) and d (dironly) modifiers the filename pattern gets optional; so you don't have to give an all-match wildcard pattern (./**) for these cases.

Examples

     t,d,./var/vmail/**
     t./var/vmail/**/.*.sieve
     ./var/vmail/**
This would take all ".*.sieve" files (or directories) below /var/vmail, in all depths, and all directories there; but no other files.

If your files are at a certain depth, and you don't want all other directories taken, too, you can specify that exactly:

     td./var/vmail/*
     td./var/vmail/*/*
     t./var/vmail/*/*/.*.sieve
     ./var/vmail/**

     m:04:0
     t,./etc/
     ./**
This would take all files from /etc, but ignoring the files that are not world-readable (other-read bit cleared).

Generated for fsvs by  doxygen 1.5.9