Meta tags:
Headings (most frequently used words):
resources, collatex, workshop, oo, col, computer, supported, collation, with, description, and, goals, instructors, schedule, general, installation, data, other,
Text of the page (most frequently used words):
the (22), collatex (18), and (14), #collation (14), unit (13), from (9), xml (7), workshop (7), for (6), with (6), python (6), computer (5), local (5), part (5), text (5), installation (5), university (5), digital (4), supported (4), tokens (4), markup (4), information (4), github (4), output (4), source (4), windows (4), install (4), break (4), building (4), how (4), van (3), humanities (3), gothenburg (3), resources (3), using (3), ipython (3), this (3), plain (3), tei (3), critical (3), witnesses (3), instructions (3), levenshtein (3), catered (3), foyer (3), participants (3), 2015 (3), ronald (2), haentjens (2), dekker (2), joris (2), zundert (2), project (2), model (2), general (2), processing (2), differently (2), according (2), recognizing (2), tracking (2), during (2), normalization (2), notebook (2), step (2), tutorial (2), creating (2), watch (2), space (2), input (2), files (2), use (2), simple (2), collating (2), options (2), alignment (2), table (2), variant (2), graph (2), parallel (2), segmentation (2), apparatus (2), tokenization (2), other (2), are (2), data (2), file (2), bit (2), python_levenshtein (2), cp34 (2), none (2), whl (2), binaries (2), microsoft (2), can (2), you (2), have (2), that (2), coffee (2), lunch (2), automated (2), huygens (2), ing (2), david (2), birnbaum (2), must (2), own (2), will (2), western (2), sydney (2), acknowledgements, usb, sticks, generously, contributed, koala, keyboard, image, buzzfeed, exist, dirk, hulle, gregor, middell, vincent, neyt, vol, 2014, literary, linguistics, computing, journal, scholarship, modern, manuscripts, beckett, manuscript, modular, architecture, aided, terminology, glossary, main, web, site, units, unicode, reading, multiline, revising, code, classes, collated, powerpoint, slides, oxford, archive, subdirectory, 2499, download, zip, partonopeus, blois, six, versions, derived, charles, darwin, origin, species, procedure, described, detail, our, win_amd64, win32, users, who, unable, library, these, precompiled, instead, mirrored, http, www, lfd, uci, edu, gohlke, pythonlibs, pip, pre, upgrade, already, installed, make, sure, most, recent, version, running, applications, showcase, refining, environment, command, line, hierarchical, system, theory, copy, schedule, leif, jöran, olsson, pittsburgh, bern, tara, andrews, lead, instructor, instructors, bring, their, laptops, preparation, see, links, below, prior, programming, experience, required, teach, open, tool, compare, automatically, way, used, produce, textual, editions, types, comparative, documents, learn, prepare, materials, any, written, script, perform, inspect, modify, results, one, day, annual, international, conference, alliance, organizations, adho, hosted, through, july, takes, place, monday, room, 109, your, parramatta, south, campus, dh2015, global, description, goals, 2026, 07t17, 0000, last, modified, djbpitt, gmail, com, maintained, col,
Text of the page (random words):
computer supported collation with collatex oo col computer supported collation with collatex maintained by david j birnbaum djbpitt gmail com last modified 2026 02 07t17 14 04 0000 description and goals the one day computer supported collation with collatex workshop is part of dh2015 global digital humanities the annual international conference of the alliance of digital humanities organizations adho hosted by the university of western sydney from 2015 06 03 through 2015 06 03 july the workshop takes place on monday 2015 06 29 in building ea room 109 at the university of western sydney parramatta south campus from 9 30 4 30 with a lunch break on your own from 12 30 1 30 this workshop will teach participants how to use the open source collatex collation tool to compare witnesses of a text automatically in a way that can be used to produce critical textual editions and other types of comparative documents participants will learn how to prepare source materials in any written script for collation how to perform automated collation using collatex and how to inspect and modify the results participants must bring their own laptops and must install python 3 and collatex in preparation for the workshop see the links to installation instructions below no prior python programming experience is required instructors ronald haentjens dekker huygens ing lead instructor tara andrews university of bern david j birnbaum university of pittsburgh leif jöran olsson university of gothenburg joris van zundert huygens ing schedule 9 30 10 10 unit 1 theory of collation gothenburg model automated collation local copy 10 10 10 45 unit 2 collatex environment ipython the command line the hierarchical file system 10 45 11 15 coffee break catered ea building foyer 11 15 11 50 unit 3 witnesses tokens tokenization 11 50 12 30 unit 4 collating plain text output options alignment table variant graph tei parallel segmentation critical apparatus 12 30 1 30 lunch break catered ea building foyer 1 30 2 10 unit 5 using collatex with xml recognizing and tracking markup information during collation 2 10 2 45 unit 6 refining the collation normalization 2 45 3 15 coffee break catered ea building foyer 3 15 3 50 unit 7 processing tokens differently according to markup information 3 50 4 30 unit 8 applications of collatex workshop project showcase workshop resources installation python 3 and collatex installation instructions if you have already installed collatex make sure that you have the most recent version by running pip install pre upgrade collatex python levenshtein binaries for microsoft windows users of microsoft windows who are unable to install the python levenshtein library from source can install these precompiled binaries instead mirrored from http www lfd uci edu gohlke pythonlibs python levenshtein python_levenshtein 0 12 0 cp34 none win32 whl for 32 bit windows python_levenshtein 0 12 0 cp34 none win_amd64 whl for 64 bit windows the installation procedure is described in detail in our installation instructions data charles darwin the origin of species six versions xml and plain text derived from collatex source partonopeus de blois oxford text archive download zip file tei xml files are in 2499 data xml subdirectory other workshop resources unit 3 witnesses tokens tokenization github local unit 4 collating plain text output options alignment table variant graph tei parallel segmentation critical apparatus github local units 5 6 using collatex with xml recognizing and tracking markup information during collation powerpoint slides unit 5 unit 6 general tutorial ipython notebook step by step tutorial part 1 creating simple collated output from simple xml input part 2 revising the code to use classes part 3 reading multiline input from files watch this space part 4 creating xml output watch this space unicode normalization ipython notebook github local unit 7 processing tokens differently according to markup information github local general collatex resources main collatex web site glossary of collatex and collation terminology the gothenburg model a modular architecture for computer aided collation computer supported collation of modern manuscripts collatex and the beckett digital manuscript project ronald haentjens dekker dirk van hulle gregor middell vincent neyt and joris van zundert literary and linguistics computing the journal of digital scholarship in the humanities vol 25 2014 03 19 1 19 acknowledgements usb sticks generously contributed by exist db koala keyboard image from buzzfeed
|