I have a requirement to extract study titles from clinical documents in PDF and MS Word formats. There is no reliable pattern to the text or the formatting of the titles, so my options for direct querying are limited.
Are there any entity enrichment tools which might help to identify study titles in the clinical-document domain? Temis Luxid looks promising, but I have not been able to locate the Samples directory on my MarkLogic AWS image, so I don't know how to get started with that option. Bob
_______________________________________________ General mailing list [email protected] Manage your subscription at: http://developer.marklogic.com/mailman/listinfo/general
