|文件類型：||文章, 電腦資料, 網路資源|
Edward T O'Neill; OCLC. Office of Research.
|注意：||Title from title screen (viewed Mar. 26, 2004).|
|詳述：||Mode of access: World Wide Web.|
|責任：||project manager, Edward T. O'Neill.|
The primary goal of the study was to develop software which could cluster all manifestations of a work of English language fiction. A work is considered the set of related texts that have a common origin and content. To associate individual bibliographic entries with the corresponding work, we extended string matching algorithms originally developed for duplicate detection.