Change history - richardlehane/siegfried GitHub Wiki
Version 1.4.3 (19/12/2015)
- measure time elapsed with -log time
- bugfix: percent encode file URIs in droid output
- bugfix: long windows directory paths (further work on bug fixed in 1.4.2); reported by Ross Spencer
- bugfix: mscfb panic; reported by Ross Spencer
- bugfix: TIFF mis-identifications due to an early halt error
Version 1.4.2 (27/11/2015)
- new -throttle flag; requested by Ross Spencer
- errors logged to stderr by default (to quieten use -log ""); requested by Ross Spencer
- mscfb update: lazy reading
- webarchive update: decode Transfer-Encoding and Content-Encoding; requested by Dragan Espenschied
- bugfix: long windows paths; reported by Ross Spencer
- bugfix: 32-bit file size overflow; reported by Ross Spencer
Version 1.4.1 (6/11/2015)
- -log replaces -debug, -slow, -unknown and -known flags (see usage above)
- highlight empty file/stream with error and warning
- negative text match overrides extension-only plain text match
Version 1.4.0 (31/10/2015)
- new MIME matcher; requested by Dragan Espenschied
- support warc continuations
- add all.json and tiff.json sets
- minor speed-up
- report less redundant basis information
- report error on empty file/stream
Version 1.3.0 (27/9/2015)
- scan within warc and arc files with -z flag; reqested by Dragan Espenschied
- quit scanning earlier on known unknowns
- don't include byte signatures where formats have container signatures (unless -doubleup flag is given); fixes a mis-identification reported by Ross Spencer
- sf -slow FILE | DIR reports slow signatures
- sf -debug output simplified
- sf -version describes signature file; requested by Michelle Lindlar
- roy -limit and -exclude now operate on text and default zip matches
- roy -nopriority re-configured to return more results
- bugfix: upgraded versions of sf panic when attempting to read old signature files; reported by Stefan
- bugfix: panic mmap'ing files over 1GB on Win32; reported by Duncan
- bugfix: reporting extensions for folders with "."; reported by Ross Spencer
Version 1.2.2 (15/8/2015)
- add -noext flag to roy to suppress extension matching; requested by Greg Lepore
- -known and -unknown flags for sf to output lists of recognised and unknown files respectively; requested by Greg Lepore
Version 1.2.1 (11/8/2015)
- support annotation of sets.json files; requested by Greg Lepore
- add warning when use -extendc without -extend
- bugfix: report container extensions in details; reported by Ross Spencer
Version 1.2.0 (31/7/2015)
- text matcher (i.e. sf README will now report a 'Plain Text File' result)
- -notext flag to suppress text matcher (roy build -notext)
- all outputs now include file last modified time
- -hash flag with choice of md5, sha1, sha256, sha512, crc (e.g. sf -hash md5 FILE)
- -droid flag to mimic droid output (sf -droid FILE)
- bugfix: detect encoding of zip filenames reported by Dragan Espenschied
- bugfix: mscfb reported by Dragan Espenschied
Version 1.1.0 (17/5/2015)
- scan within archive formats (zip, tar, gzip) with -z flag
- format sets (e.g. roy build -exclude @pdfa)
- leaner, faster signature format
- support bitmask patterns
- mirror bof patterns as eof patterns where both roy -bof and -eof limits set
- bugfix: (mscfb) reported by Pascal Aantz
- bugfix: race condition in scorer (affected tip golang)
Version 1.0.0 (22/3/2015)
- user documentation
- bugfixes (mscfb, match/wac and sf)
- QA using comparator
Version 0.8.2 (22/2/2015)
- json output
- server mode
Version 0.8.1 (1/2/2015)
- bugfix: single quote YAML output
Version 0.8.0 (26/1/2015)
- optimisations (mmap, multithread, etc.)
Version 0.7.1 (9/12/2014)
- csv output
- periodic priority checking to stop searches earlier
- range/distance/choices bugfix
Version 0.7.0 (24/11/2014)
- change to signature file format
Version 0.6.1 (21/11/2014)
- roy (r2d2 rename) signature customisation
- parse Droid signature (not just PRONOM reports)
- support extension signatures
Version 0.6.0 (11/11/2014)
- support multiple identifiers
- config package
- mscfb bugfixes
- license info in srcs (no change to license; this allows for attributing authorship for non-Richard contribs)
- default home change to "$HOME/siegfried" (no longer ".siegfried")
Version 0.5.0 (1/10/2014)
- container matching
Version 0.4.2 (23/09/2014)
- cross-compile was broken (because of use of os/user). Now doing native builds on the three platforms so the download binaries should all work now.
Version 0.4.2 (16/09/2014)
- bug in processing code caused really bad matching profile for MP3 sigs. No need to update the tool for this, but please do a sieg -update to get the latest signature file.
Version 0.4.1 (14/09/2014)
- sf command line: descriptive output in YAML, including basis for matches
- optimisations inc. initial BOF loop before main matching loop
Version 0.4 (24/08/2014)
- sf command line changes: -version and -update flags now enabled
- over-the-wire updates of signature files from www.itforarchivists.com/siegfried
Version 0.3 (19/08/2014)
- replaced ac matcher with wac matcher
- re-write of bytematcher code
- some benchmarks slower but fewer really poor edge cases (see cmd/sieg/testdata/bench_results.txt)... so a win!
- but still too slow!
Version 0.2 (26/03/2014)
- benchmarks (cmd/sieg/testdata)
- an Identifier type that controls the matching process and stops on best possible match (i.e. no longer require a full file scan for all files)
- name/extension matching
- a custom reader (pkg/core/siegreader)
- simplifications to the sieg command and signature file
- optimisations that have boosted performance (see cmd/sieg/testdata/bench_results.txt). But still too slow!
Version 0.1 (28/02/2014)
First release. Parses PRONOM signatures and performs byte matching. Bare bones CLI. Glacially slow!