Thesaurus configuration
A thesaurus is a book that lists words grouped together according to similarity of meaning (containing synonyms and sometimes antonyms), in contrast to a dictionary, which contains definitions and pronunciations.
In Information Science, Library Science, and Information Technology, specialized thesauri are designed for information retrieval. They are a type of controlled vocabulary, for indexing or tagging purposes. If you want more information about the term you can take a look at http://en.wikipedia.org/wiki/Thesaurus.
OpenKM can work with thesaurus defined in .owl and .rdfs formats.
There's a great utility http://protege.stanford.edu/ for creating and maintaining your own thesaurus. Specific thesauri can be found on the Internet. For example, Agrovoc ( FAO ), NASA Thesaurus, UNESCO Thesaurus, Public administration Thesaurus among others. Some of them are free and some have usage restrictions.
In order to correctly configure OpenKM to use a thesaurus the following configuration properties must be set:
Property | Description |
---|---|
kea.thesaurus.owl.file |
The ontology file ( Web ontology file ) vocabulary/file.owl |
kea.thesaurus.base.url |
The base URL. http://www.someweb.org |
kea.thesaurus.tree.root |
Query to get the first node. SELECT DISTINCT UID, TEXT FROM {UID} Y {OBJECT}, {UID} rdfs:label {TEXT} ; [rdfs:subClassOf {CLAZZ}] where not bound(CLAZZ) |
kea.thesaurus.tree.children |
Query to get children node. SELECT DISTINCT UID, TEXT FROM {UID} rdfs:subClassOf {CLAZZ}, {UID} rdfs:label {TEXT} where xsd:string(CLAZZ) = |
Take a look a full configuration example at Creating your own thesaurus.
Some interesting URL: