migration-tools.git
14 years agorevert previous commit (swept too much in)
Galen Charlton [Tue, 13 Apr 2010 19:22:01 +0000]
revert previous commit (swept too much in)

14 years agorecover gracefully after an XML parsing error
Galen Charlton [Tue, 13 Apr 2010 19:18:52 +0000]
recover gracefully after an XML parsing error

14 years agoNAME
Galen Charlton [Tue, 13 Apr 2010 00:19:06 +0000]
NAME
       xlsx2tab

SUMMARY
       Quick-and-dirty filter lifted from the Spreadsheet::XSLX POD to convert the first sheet of an Excel .xlsx file to TSV

USAGE
       xlsx2tab foo.xlsx > foo.tsv

14 years agoWill leave 'unlisted' alone
Ben Ostrowsky [Mon, 12 Apr 2010 22:08:34 +0000]
Will leave 'unlisted' alone

14 years agoHandling a few more exceptions...
Ben Ostrowsky [Mon, 12 Apr 2010 22:06:41 +0000]
Handling a few more exceptions...

14 years agoAdded logic to attempt_phone for handling a few more variants correctly
Ben Ostrowsky [Mon, 12 Apr 2010 21:54:27 +0000]
Added logic to attempt_phone for handling a few more variants correctly

14 years agoAdded function attempt_phone (TEXT,TEXT) RETURNS TEXT. First argument is legacy...
Ben Ostrowsky [Mon, 12 Apr 2010 21:47:43 +0000]
Added function attempt_phone (TEXT,TEXT) RETURNS TEXT.  First argument is legacy phone data, second is area code to prepend.

14 years agodebugged a comment ;)
Ben Ostrowsky [Tue, 6 Apr 2010 14:55:30 +0000]
debugged a comment ;)

14 years agoRenamed utility, added --help and perldoc documentation
Ben Ostrowsky [Tue, 6 Apr 2010 14:50:54 +0000]
Renamed utility, added --help and perldoc documentation

14 years agoMoving DBF-to-TSV utility (not Athena-specific)
Ben Ostrowsky [Tue, 6 Apr 2010 14:31:16 +0000]
Moving DBF-to-TSV utility (not Athena-specific)

14 years agoFirst version of a utility to convert Athena DBF to TSV suitable for \copy.
Ben Ostrowsky [Tue, 6 Apr 2010 14:23:16 +0000]
First version of a utility to convert Athena DBF to TSV suitable for \copy.
Suitable includes escaping backslash, tab, newline, and carriage return,
and converting output to UTF-8.

14 years agonormalize output by escaping backslashes
Galen Charlton [Wed, 17 Mar 2010 19:20:33 +0000]
normalize output by escaping backslashes

This allows psql to import the data correctly; use
the new --disable-pg-normalization option to disable
this behavior if the output is not meant to go
directly into a Pg database via copy or \copy

14 years agofix retrieval of correct column headers and data from non-holdings tags
Galen Charlton [Mon, 15 Mar 2010 16:34:57 +0000]
fix retrieval of correct column headers and data from non-holdings tags

This work was sponsored by the Indiana State Library.

14 years agoimproved handling for drawing data from fields other than the holdings tag
Galen Charlton [Fri, 12 Mar 2010 20:32:45 +0000]
improved handling for drawing data from fields other than the holdings tag

Suppose the 852 is the main holdings tag, but information is needed
from the 590$a (e.g., as an item note).  The extract_holdings configuration file
can now work as follows:

# to map the subfield contents to the item record
barcode 852 p
note    590 a

# to map, but have the 590$a information be associated
# only with the first item in the bib; if m:first is
# not supplied, the 590$a note will be supplied
# to all items associated with the bib
barcode 852 p
note    590 a m:first

# to put all occurrences of the 590$a into a separate
# file for linking with the appropriate items.  m:multi
# *must* be supplied if the 590 tag occurs more than once
# in the bib records
barcode 852 p
note    590 a m:multi

# to concatenate repeats of $a in the 590 together
barcode 852 p
note    590 a m:concatenate

# to concatenate with a custom separator
barcode 852 p
note    590 a m:concatenate c:' / '

Development sponsored by the Indiana State Library

14 years agofix error in Codabar checkdigit algorithm
Galen Charlton [Mon, 8 Mar 2010 21:14:08 +0000]
fix error in Codabar checkdigit algorithm

14 years agoasset.copy.price is NUMERIC(8,2),
Ben Ostrowsky [Tue, 2 Feb 2010 15:55:35 +0000]
asset.copy.price is NUMERIC(8,2),
                not NUMERIC(6,2).

14 years agonew function add_codabar_checkdigit
Galen Charlton [Fri, 29 Jan 2010 15:42:45 +0000]
new function add_codabar_checkdigit

-- add_codabar_checkdigit
--   $barcode      source barcode
--
-- If the source string is 13 or 14 characters long and contains only digits, adds or replaces the 14
-- character with a checkdigit computed according to the usual algorithm for library barcodes
-- using the Codabar symbology - see <http://www.makebarcode.com/specs/codabar.html>.  If the
-- input string does not meet those requirements, it is returned unchanged.

14 years agoAdded migration_tools.attempt_money, which returns NUMERIC(6,2)
Ben Ostrowsky [Thu, 28 Jan 2010 17:35:37 +0000]
Added migration_tools.attempt_money, which returns NUMERIC(6,2)

14 years agoLigonier (PA) is Polaris, yes, but Ligonier (IN) is Mandarin.
Ben Ostrowsky [Tue, 5 Jan 2010 20:17:38 +0000]
Ligonier (PA) is Polaris, yes, but Ligonier (IN) is Mandarin.

14 years agoPolaris patron extractor, first working version
Ben Ostrowsky [Tue, 5 Jan 2010 14:30:27 +0000]
Polaris patron extractor, first working version

14 years agoA tool I found online
Ben Ostrowsky [Tue, 29 Dec 2009 15:40:01 +0000]
A tool I found online

14 years agoScript now handles Jr and Sr without a trailing period, and the suffix IV.
Ben Ostrowsky [Mon, 28 Dec 2009 17:34:46 +0000]
Script now handles Jr and Sr without a trailing period, and the suffix IV.

14 years agoquell warning if no Koha bib ID found
Galen Charlton [Wed, 9 Dec 2009 13:52:47 +0000]
quell warning if no Koha bib ID found

14 years agoadded koha_bib_id fingerprint option (for Koha migrations for
Galen Charlton [Tue, 8 Dec 2009 20:43:43 +0000]
added koha_bib_id fingerprint option (for Koha migrations for
Indiana from INCOLSA)

14 years agofix regex for extracting bib ID from input
Galen Charlton [Wed, 25 Nov 2009 14:27:04 +0000]
fix regex for extracting bib ID from input

14 years agoFixed another inaccuracy in the inline documentation
Ben Ostrowsky [Tue, 10 Nov 2009 18:11:14 +0000]
Fixed another inaccuracy in the inline documentation

14 years agoMade inline-documentation references to Equinox::Migration modules more explicit
Ben Ostrowsky [Tue, 10 Nov 2009 15:12:24 +0000]
Made inline-documentation references to Equinox::Migration modules more explicit

14 years agoMigration schemas will now handle action.hold_notification
Ben Ostrowsky [Fri, 30 Oct 2009 14:43:58 +0000]
Migration schemas will now handle action.hold_notification

14 years agofixed minor errors in extract_loadset that prevent it from functioning
Galen Charlton [Mon, 12 Oct 2009 14:23:33 +0000]
fixed minor errors in extract_loadset that prevent it from functioning

14 years agoutility methods for best-effort datatype casting
Jason Etheridge [Tue, 8 Sep 2009 09:00:48 +0000]
utility methods for best-effort datatype casting

14 years agodeclare expand_barcode to be STABLE
Galen Charlton [Wed, 26 Aug 2009 19:00:28 +0000]
declare expand_barcode to be STABLE

As noticed by Jason, doing this gives a signficant
performance boost.

14 years agosupport for dynamic mapping of circ fields
Jason Etheridge [Wed, 26 Aug 2009 07:07:22 +0000]
support for dynamic mapping of circ fields

14 years agoadded new function to pad barcodes
Galen Charlton [Mon, 24 Aug 2009 16:20:34 +0000]
added new function to pad barcodes

-- expand_barcode
--   $barcode      source barcode
--   $prefix       prefix to add to barcode, NULL = add no prefix
--   $maxlen       maximum length of barcode; default to 14 if left NULL
--   $pad          padding string to apply to left of source barcode before adding
--                 prefix and suffix; set to NULL or '' if no padding is desired
--   $suffix       suffix to add to barcode, NULL = add no suffix
--
-- Returns a new string consisting of prefix concatenated with padded barcode and suffix.
-- If new barcode would be longer than $maxlen, the original barcode is returned instead.

Example usage:

UPDATE foo
SET barcode = migration_tools.expand_barcode(barcode, '21234', 14, '0', '');

14 years agosyntax
Jason Etheridge [Wed, 12 Aug 2009 05:53:55 +0000]
syntax

14 years agomove copy location mapping functionality into base.sql
Jason Etheridge [Wed, 12 Aug 2009 05:11:04 +0000]
move copy location mapping functionality into base.sql

14 years agomove dynamic item mapping functionality to base.sql; it's not ILS specific
Jason Etheridge [Wed, 12 Aug 2009 04:58:48 +0000]
move dynamic item mapping functionality to base.sql; it's not ILS specific

14 years agothis stored procedure is a true function with no side effects
Jason Etheridge [Tue, 11 Aug 2009 10:39:15 +0000]
this stored procedure is a true function with no side effects

14 years agoname parser
Jason Etheridge [Tue, 11 Aug 2009 10:37:09 +0000]
name parser

14 years agofix signature
Jason Etheridge [Sat, 8 Aug 2009 22:52:13 +0000]
fix signature

14 years agostart moving this mapping functionality over to base, since the only part that is...
Jason Etheridge [Sat, 8 Aug 2009 22:41:00 +0000]
start moving this mapping functionality over to base, since the only part that is ILS specific is the data that drives it

14 years agoMDMP require version bumped
Shawn Boyette [Mon, 27 Jul 2009 16:06:22 +0000]
MDMP require version bumped
believe non-holding multis work now
null non-holdings data handled better

14 years ago1.005
Shawn Boyette [Mon, 27 Jul 2009 16:06:21 +0000]
1.005
added 'ignoremulti' modifier
mods handling slightly tweaked

14 years agonewline at end of sample run
Shawn Boyette [Tue, 14 Jul 2009 03:07:46 +0000]
newline at end of sample run

14 years ago--marcfile no longer gets -m; --map does
Shawn Boyette [Mon, 13 Jul 2009 18:14:12 +0000]
--marcfile no longer gets -m; --map does

14 years agoautoset marcfile on prefix
Shawn Boyette [Thu, 9 Jul 2009 16:29:41 +0000]
autoset marcfile on prefix

14 years agooopses
Shawn Boyette [Thu, 9 Jul 2009 14:24:34 +0000]
oopses

14 years agoredo
Shawn Boyette [Thu, 9 Jul 2009 14:09:18 +0000]
redo

14 years agoredoing previous changes
Shawn Boyette [Thu, 9 Jul 2009 14:09:17 +0000]
redoing previous changes

14 years agorescued changes
Shawn Boyette [Thu, 9 Jul 2009 13:56:41 +0000]
rescued changes

14 years agoyanking DBM::Deep stuffs
Shawn Boyette [Thu, 25 Jun 2009 17:31:51 +0000]
yanking DBM::Deep stuffs

14 years agoremoving DBMD, phase I
Shawn Boyette [Thu, 25 Jun 2009 17:31:50 +0000]
removing DBMD, phase I

14 years agoold2new generation now handles controlfields and will map any number of matches,...
Shawn Boyette [Wed, 24 Jun 2009 19:20:19 +0000]
old2new generation now handles controlfields and will map any number of matches, not just one-per-record

14 years agotweaks
Shawn Boyette [Wed, 24 Jun 2009 19:20:19 +0000]
tweaks

14 years agotypo fix
Shawn Boyette [Wed, 24 Jun 2009 19:20:19 +0000]
typo fix

14 years agoimproved multifile handling
Shawn Boyette [Wed, 24 Jun 2009 19:20:18 +0000]
improved multifile handling

14 years agomulti extractions work now. holding seq# always included now
Shawn Boyette [Wed, 24 Jun 2009 19:20:18 +0000]
multi extractions work now. holding seq# always included now

14 years agomaking sampling work better
Shawn Boyette [Wed, 24 Jun 2009 19:20:17 +0000]
making sampling work better

14 years agoadded subs thresholding
Shawn Boyette [Wed, 24 Jun 2009 19:20:17 +0000]
added subs thresholding

14 years agoversion bump
Shawn Boyette [Wed, 24 Jun 2009 19:20:16 +0000]
version bump

14 years agofixes for multi subfields
Shawn Boyette [Wed, 24 Jun 2009 19:20:16 +0000]
fixes for multi subfields
added get_multis method

14 years agoautoflush turned off for DBM::Deep
Shawn Boyette [Wed, 24 Jun 2009 19:20:15 +0000]
autoflush turned off for DBM::Deep
name method doesn't require rec# anymore
{names} gone from record dataset

14 years agotest for existing DBMD file; die
Shawn Boyette [Thu, 18 Jun 2009 18:23:37 +0000]
test for existing DBMD file; die

14 years agoset default window size to 9 lines
Shawn Boyette [Thu, 18 Jun 2009 18:23:36 +0000]
set default window size to 9 lines

14 years agoholdings dump status msg now counts holdings instead of records
Shawn Boyette [Thu, 18 Jun 2009 18:23:35 +0000]
holdings dump status msg now counts holdings instead of records

14 years agoworks with new MDMP
Shawn Boyette [Fri, 12 Jun 2009 18:30:37 +0000]
works with new MDMP

14 years agotake rcnt out of dstore and call it reccount
Shawn Boyette [Fri, 12 Jun 2009 18:30:36 +0000]
take rcnt out of dstore and call it reccount

14 years agochanged rptr to rcnt
Shawn Boyette [Fri, 12 Jun 2009 18:30:36 +0000]
changed rptr to rcnt

14 years agoargs handling bug
Shawn Boyette [Fri, 12 Jun 2009 18:30:36 +0000]
args handling bug

14 years agoKeeping MDMP from eating all teh rams (WIP)
Shawn Boyette [Fri, 12 Jun 2009 18:30:35 +0000]
Keeping MDMP from eating all teh rams (WIP)

14 years agoscope tweaking
Shawn Boyette [Fri, 12 Jun 2009 18:30:34 +0000]
scope tweaking

14 years ago1.003 No longer purely OO, to allow XML::Twig to run in handler mode (cuts memory...
Shawn Boyette [Fri, 12 Jun 2009 18:30:34 +0000]
1.003 No longer purely OO, to allow XML::Twig to run in handler mode (cuts memory usage from 2G+ to ~25M for full sampling

14 years agotest fixes for 1.003
Shawn Boyette [Fri, 12 Jun 2009 18:30:33 +0000]
test fixes for 1.003

14 years agochangelog
Shawn Boyette [Wed, 10 Jun 2009 16:54:28 +0000]
changelog

14 years ago1.002 if any instance of a sub has a value, that should now appear in detail sample...
Shawn Boyette [Wed, 10 Jun 2009 16:54:27 +0000]
1.002 if any instance of a sub has a value, that should now appear in detail sample output

14 years agosubfield report added to detail
Shawn Boyette [Fri, 29 May 2009 21:14:33 +0000]
subfield report added to detail

14 years agosbufield lengthtracking implemented
Shawn Boyette [Fri, 29 May 2009 21:14:32 +0000]
sbufield lengthtracking implemented

14 years ago1.001 changed sample detail format to one which can be dumped directly into a dokuwik...
Shawn Boyette [Wed, 27 May 2009 17:01:29 +0000]
1.001 changed sample detail format to one which can be dumped directly into a dokuwiki page

14 years agomulti hash now uses just code for key
Shawn Boyette [Wed, 27 May 2009 15:35:12 +0000]
multi hash now uses just code for key

14 years agocruddy temporary fix while multi is sorted out
Shawn Boyette [Wed, 27 May 2009 15:35:11 +0000]
cruddy temporary fix while multi is sorted out

14 years agotrying to fix memory problems, but barkingup wrong tree
Shawn Boyette [Fri, 22 May 2009 14:47:20 +0000]
trying to fix memory problems, but barkingup wrong tree

14 years agoSTL now prints file name on fail open
Shawn Boyette [Fri, 22 May 2009 14:47:20 +0000]
STL now prints file name on fail open

14 years agothis is actually bearable with \set verbosity terse
Jason Etheridge [Wed, 20 May 2009 03:35:33 +0000]
this is actually bearable with \set verbosity terse

14 years agoimproved error reporting
Shawn Boyette [Wed, 13 May 2009 14:48:49 +0000]
improved error reporting

14 years agofix for undef value in regex
Shawn Boyette [Wed, 13 May 2009 14:48:48 +0000]
fix for undef value in regex

14 years agocleanup before new work
Shawn Boyette [Thu, 7 May 2009 16:17:51 +0000]
cleanup before new work

14 years agofixes
Shawn Boyette [Wed, 6 May 2009 20:59:12 +0000]
fixes

14 years agoworks (pretty much)
Shawn Boyette [Wed, 6 May 2009 20:58:45 +0000]
works (pretty much)

14 years agotmap updates
Shawn Boyette [Tue, 5 May 2009 20:27:38 +0000]
tmap updates

14 years agosample stuffs working pretty good now
Shawn Boyette [Tue, 5 May 2009 16:43:45 +0000]
sample stuffs working pretty good now

14 years agofixes to tag counts
Shawn Boyette [Tue, 5 May 2009 16:32:18 +0000]
fixes to tag counts

14 years agomisc
Shawn Boyette [Tue, 5 May 2009 15:12:05 +0000]
misc

14 years agosample routines basically working
Shawn Boyette [Tue, 5 May 2009 15:00:01 +0000]
sample routines basically working

14 years agofergot semicolon derrrrr
Shawn Boyette [Thu, 30 Apr 2009 14:33:52 +0000]
fergot semicolon derrrrr

14 years agodoc updates
Shawn Boyette [Thu, 30 Apr 2009 14:33:08 +0000]
doc updates

14 years agofilters in MDMP
Shawn Boyette [Thu, 30 Apr 2009 01:49:35 +0000]
filters in MDMP

14 years agosingle-chunk tokens being quoted is ok now. misc other fixes
Shawn Boyette [Thu, 30 Apr 2009 01:33:00 +0000]
single-chunk tokens being quoted is ok now. misc other fixes

14 years agoinitial checkin
Shawn Boyette [Wed, 29 Apr 2009 23:43:01 +0000]
initial checkin

14 years agoyanking bib references; adding tag map to MDMP; cleanup and new test corpus data
Shawn Boyette [Wed, 29 Apr 2009 20:01:38 +0000]
yanking bib references; adding tag map to MDMP; cleanup and new test corpus data

14 years agowhen looking for date in 260$c, non-numerics are now removed to improve quality of...
Shawn Boyette [Wed, 29 Apr 2009 15:51:26 +0000]
when looking for date in 260$c, non-numerics are now removed to improve quality of data

14 years agoadding marcxml namespace to the wrapping collection element
Mike Rylander [Wed, 29 Apr 2009 15:22:08 +0000]
adding marcxml namespace to the wrapping collection element