|文件类型：||文章, 计算机文档, 互联网资源|
Edward T O'Neill; OCLC. Office of Research.
|注意：||Title from title screen (viewed Mar. 26, 2004).|
|详述：||Mode of access: World Wide Web.|
|责任：||project manager, Edward T. O'Neill.|
The primary goal of the study was to develop software which could cluster all manifestations of a work of English language fiction. A work is considered the set of related texts that have a common origin and content. To associate individual bibliographic entries with the corresponding work, we extended string matching algorithms originally developed for duplicate detection.