Perrin, Andrew J. CodeRead. [Computer Program]. Available:
http://www.unc.edu/~aperrin/CodeRead
@MISC{coderead,
AUTHOR="Andrew J. Perrin",
TITLE="CodeRead (Computer Program)",
NOTE="Available: {\url{http://www.unc.edu/~aperrin/CodeRead}}",
YEAR=2001
}
There is, in addition, a frontend script that remains VERY basic. This script makes it possible for non-programmers to get somewhere (although it's not clear where) using CodeRead. The script is called cr.pl and should usually be run from the command line while in the directory where CodeRead is installed. You can type cr.pl -h to get a list of options for using the script. It allows basic, but not very fine-grained, use of the tools. More will be forthcoming, I promise.
Perl will run on pretty much any platform you can think of. However, CodeRead uses the qr// construct, which means the current release of MacPerl probably won't work. I will be fixing that problem, but it's very possible that MacPerl will be updated before I get around to it. Unix users can get perl from www.perl.com if they don't already have it, and Windows users can get it from ActiveState. You must have version 5.005 or higher in order for CodeRead to work.
You can get it here.
CodeRead is available as a gzipped tarfile, which can be opened using most decompression software including gnu tar, winzip, and stuffit expander. Contact me if you have problems opening it.
Generally, CodeRead should be in its own directory, called cr or anything else you want to call it. Files should be placed as follows:
CodeRead.pm cr.pl
Project.pm Textchunk.pm Codechunk.pm Utils.pm
QCML will be released in three distinct parts:
QCML is based on SGML (ISO 8879), the Standard Generalized Markup Language.
Tags should be infinitely levelable; i.e.
Two types of codes: temporal markers have no text within them, content codes encapsulate one or more words.
Content codes last until <\QC > tag.
System maintains character number and total number of characters as built-in temporal markers.
Each tagged section of text gets a score, which is just the number of characters (words?) between
Output:
Some tags:
Technical:
Words are numbered by system. Two data structures are created: