Extracting information from English text

There are three manipulations we perform on programmes: splitting multipart programmes into constitutents with the 'clumpidx', filling in the programme category based on the description, and parsing the description to find names of actors.

(At this point the speaker shall perform a haphazard and probably broken demonstration by running tv_extractinfo_en on some input and diffing the results.)

Next: the complete set of filters
Edward Avis
Last modified: Thu Mar 14 11:45:04 GMT 2002