Abstract
In this article, we evaluate approaches using logic reasoning applied to an ontology for literary characters. The inference tool Racer and the programming language Prolog were tested and compared to see if they can serve as a computer assisted approach in this scenario.
Both offer options to draw inferences, but the usage requests a good comprehension of logics. Intuitive and individual queries are also difficult to produce with solely logic constructs. Furthermore, information in humanities is often vague, ambiguous, or even contradictory. Solving such problems by logic reasoning which is simply based on true or false conclusions will become challenging and might exceed the limits of logic reasoning.
At the moment, to the author’s knowledge, only few such approaches, especially in literature studies, have been published. Existing approaches show promising results in modelling information in humanities. Therefore, further research should be directed to make ontologies and further approaches like logic reasoning even more popular and applicable in the humanities and literature studies.
Introduction
Information seems to be essential for our lives. Humans get information everywhere,
not only on the Internet, TV or radio channels and books, but also by talking to
people or looking at road signs. We are constantly faced with problems and by
solving these problems, our ability of using information is requested. But do we
always find a solution just in time? And do we know when and where we can use
information correctly? In daily life, we have to get information as quickly as
possible. The Internet is a "mine" of information, but, like in a
real mine, we need to light the darkness, to navigate by maps so that we are able to
dig at the right point. The Internet is a giant source of information but often
completely unstructured, leading many to the idea of structuring and grouping
information presented on the Internet, the
Semantic
Web. Since the introduction of Semantic Web technologies and standards,
like
RDF and
OWL,
ontologies and other related methods have entered many research fields. In life
science, they are already established, e.g. GeneOntology [
Stevens et al. 2000]. Ontologies belong to approaches of information or
knowledge representation. Now, this technology has been also introduced into several
projects in the humanities, e.g. GOLD [
Farrar et al. 2002], WordNet [
Gomez 2004], TermNet [
Lüngen 2007], FRBR [
Renear et al 2006], and DISCOVERY [
Smith 2007]. GOLD is an ontology for descriptive linguistics, e.g. syntax, morphology or linguistic data structures. WordNet is a lexical database of English that is also presented as an ontology. Thereby, TermNet, a terminological net for the domain of text technology, takes the concepts of WordNet into account. While these are linguistic ontologies, the FRBR ontology defines concepts of bibliographic cataloguing and indexing. Based on domain ontologies and other data bases, the DISCOVERY project links different philosophical projects together.
But does the obvious popularity of ontologies really point at the advantages? Or is the popularity only a trend in research?
There are some criticisms of ontologies [
Veltman 2004], [
Shirky 2005]. Veltman points out that the definition of meaning is often rather limited when using ontologies, especially in humanities. He argues that this definition is adequate for machine-to-machine transactions in, for example, business, but it does not address the needs of culture. There, meaning comprises much more and needs to be handled by more suitable methods. In addition, Shirky criticizes the incomplete methods of classifying things. Summarized, he argues that categorization is always dependent on world views, which might not be accepted by other parties. Both authors hint at very important points, which should be discussed critically. But that goes beyond the scope of this contribution.
Nevertheless, the results of the approaches mentioned before show that ontologies and
supporting technologies like logic reasoning are useful ways to express structured
and hierarchical information in the humanities. In literature studies, the usage of
ontologies is not very common. A reason for this might be that in this field, the
contribution of an ontology to research topics is often rather difficult.
Definitions of literary phenomena are unstable and unclear. Furthermore, in
humanities and especially in literature studies, topics of research often cannot be
measured like in life science [
Zöllner-Weber & Apollon 2008]. As such it seems to be ever more challenging to introduce ontologies to these research areas, and, one might ask, which kinds of applications are offered and whether benefits might be drawn when applying such technologies.
Since their initiation in philosophy, ontologies, with their methods of subsumption
related to semantics, have been expressed by logical expressions. Thereby,
information about an ontology can be obtained using logic reasoning [
Baader et al. 2003]. By formulating logic expressions, information that is
often implicit or hidden in the ontology can be queried and conclusions can be
drawn. This means that information about the structure and elements of ontologies
can be retrieved, which can be of interest for human users, i.e. manual processing
and automated processing, respectively. For example, even for human users, it might
be difficult to get a detailed overview of relations between the elements in large
ontologies without appropriate methods and utilities. Another interesting aspect
might arise if ontologies from different sources are merged or mapped [
Noy 2003, 18–9]. A merging or mapping may result in an enlarged description of the ontology domain.
Such a method is also of interest for schemes and models of topics in the humanities.
Semantic relations that are essential in humanities can be expressed by creating an
ontology. After storing information in an ontology, consecutive processing, like logic reasoning, creates further possibilities like consistency checks or support by merging data.
The aim of this work is to investigate probable applications for ontologies in the humanities, especially for literature studies. As mentioned, it is rather challenging to work with ontologies in humanities. In this contribution, an example ontology and its supporting tools, here focused on logic reasoning, are reviewed on three levels: a) the user’s view, b) the methodological limits and c) limits of the tools with respect to literature studies.
In Section 2.1, an introduction to ontologies, especially the way they are used in
Artificial Intelligence, is given, and their connection to logic reasoning is
outlined. In Sections 2.2 and 2.3, an ontology for the description of literary
characters and the application of logic reasoning to this ontology is presented
[
Zöllner-Weber 2006]. In Section 3, logics related to ontologies are introduced.
Afterwards, the ontology for literary characters is used to test different
logic-based methods; two tools,
Racer and
Prolog, are compared.
Racer represents pure inference machines, whereas
Prolog has been taken as an example for logic programming languages. In Section 4, the ontology and the applied logical reasoning tools regarding the aforementioned points are discussed. Finally, a conclusion and an outlook are given.
Materials and Methods
In the next sections, an introduction to ontologies and especially to the ontology for literary characters is given.
Introduction to Ontologies
Briefly, ontologies are used to provide organisation and retrieval of information
semantically. Thereby, similarities between information provided by an ontology are searched by means of a semantic relation rather than by matching search strings or other similar measures. There are two comprehensions of the term of ontology: the pure formalistic approach of classifying objects as in the field of Artificial Intelligence and machine learning, and the more transcendental approach in the humanities and especially philosophy.
The term ontology, which originates from philosophy and contains the study of
being or existence in general, was introduced by Aristotle [
Philosophie-Lexikon 1991]. Aristotle was interested in describing the existence of things in the
world. He asked fundamental questions like "What is
existence?" On the one hand, he discussed human existence as it is
still discussed in philosophy, and on the other hand, he developed a system
(universals and particulars) of how the existence can be described, a formalism.
In particular, this is based on descriptive logics. The system of these logics
is contained in an ontology, which inherits formalistic principles, and
therefore is a form of expressing logics. In the ancient world and maybe in
Aristotle’s view, the different sciences did not exist separately. In today’s
view, he mingled transcendental questions with pure formalism. Over the course
of centuries, these two parts have been split into different sciences. In a
simplified way, formalism and classification have been moved to natural science
and mathematics while the transcendental core of Aristotle’s questions is
discussed in philosophy and theology. By transferring the term of an ontology to
the AI, it is more restricted and focused on modelling of concepts of the real
world in computer systems. Gruber gives a definition: "An ontology is a formal, explicit specification of a shared
conceptualisation"
[
Gruber 1993, 199]. In
this approach, Gruber refers to groups of people with a common comprehension of
the world. In that respect, the initial question of existence is no longer in
focus but still has to be described. Therefore, the formalism relied upon in an
ontology is used. An ontology in AI comprises hierarchically structured
concepts, also called classes, of a part of the world (a
domain).
The representation should be produced in a machine-readable language. Noy et al.
explain, "An ontology defines a common vocabulary for
researchers who need to share information in a domain"
[
Noy et al. 2001, 1]. An ontology consists of a set of objects
that are divided into classes, concepts, properties (also slots, or roles) and
the restrictions of the roles [
Noy et al. 2001, 3]. By relating main
and more specialized classes to an ontology, a hierarchy can be created.
Additionally, so-called instances of the classes represent individual objects of
the selected domain. Although, the structure of an ontology is rather static,
the included information can be queried and manipulated in several ways, e.g.
using W3C standards like the
Web Ontology Language
(OWL) or
Resource Description Framework Schema
(RDFS) [
Antoniou & Harmelen 2003]. OWL has been created to enlarge web sites
with semantic information and to make the Internet usable as a structured
information source [
Ziegler 2004, 126].
Discussion about ontologies, in this contribution, is focused on the formalistic method. Therefore, the discussion of existence, which has a long tradition in humanities, is not relevant here because an ontology is just seen as a very useful method to structure phenomena of literature studies. In the following, the ontology, criteria, and tools are presented.
An Ontology for the Description of Literary Characters
To investigate ontologies and logic reasoning as tools in humanities, a concrete
example is used. Here, the description of literary characters has been realised as an ontology.
Briefly, several theories of literary characters [
Jannidis 2004], [
Nieragden 1995] are combined to create a base of
a formal description using an OWL ontology [
Zöllner-Weber & Witt 2006]. Categories which describe features and actions of
characters mentioned in stories had to be adapted and applied. It is aimed at
representing the mental information structure of a reader, which (s)he has in
mind when reading a book [
Jannidis 2004, 185]. Categories
describing general aspects of literary characters form the main classes of the
ontology, e.g. inner and outer features, actions on other characters and
objects. In this contribution, characters are regarded as individual objects,
which are not linked to roles or archetypes [
Propp 1968], [
Greimas 1971]. Asking what a character means to a reader means not
what kind of characters we have, but what general structure and attributes a
character has. We regard a character as a complex cognitive entity in the
reader’s mind, rather than emphasizing metaphors or archetypes. In this
contribution, approaches that separate characters by roles, genres, etc., are
fazed out, because they might superimpose literary criticism on the daily
phenomenon of reading literature. Reading literature and understanding
characters does not necessarily require that everyone be a scholar in literature
studies. It is of interest how a normal person as well as a scholar describes a
character. By using a hierarchy, individual descriptions of characters should
get a common ground to comparing these descriptions. By using sub classes,
categories can subsume features of special characters or groups of characters.
In addition, so-called instances of the classes represent individual and
explicit objects of the domain of literary characters. Here, direct information
about a character given in a text is assigned to an instance. Together with the
information of the class hierarchy and other instances, a single mental
representation of a character is modelled (cf.
Figure 1). In this
approach, individual description, the pre-step of interpretation, is focused.
Finding the same patterns revealed in different descriptions might be regarded
as a common sense. By analysing these, similarities based on the cultural
background of readers or of writing and reading traditions might be found. A
more detailed description of this ontology is given in [
Zöllner-Weber 2006], [
Zöllner-Weber & Witt 2006].
Applications using the Ontology for Literary Characters
Having explained the concrete implementation of the ontology of literary characters, we focus this section on logic reasoning, which is a central principle when dealing with ontologies.
In general, conclusions drawn from logic reasoning can be inferred from given
information. This means implicit information is derived from explicit
information. In formal logic, given information is called assumption; the
operation is called conclusion. The methods induction [
Charniak & McDermott 1985, 22] and deduction [
Charniak & McDermott 1985, 14] are subsumed under the term of logic reasoning. Induction means inferring from special concepts to general concepts whereas deduction constitutes the opposite process from general concepts to special ones. Here, the conclusions can only receive the values true or false. The assumptions of the induction have to be true so that a conclusion can be constituted as true as well. The conclusion is called an inference.
Logic reasoning is one possible application for ontologies. It is probably
helpful (i) to check consistency during ontology development, (ii) to enable
semi-automatic merging of (domain) ontologies as well as (iii) to deduce hidden
information contained in the ontology. These three tasks can be applied to all
elements of ontologies, classes as well as instances. As Baader et al. mention,
logic reasoning can fulfill different purposes in the phase of creating an
ontology and in the phase of using it [
Baader et al. 2003, 4], e.g. investigating the structure of categories/ concepts, or testing if every object is used in the intended and not contradictory way. In different situations of a work process, logic reasoning can be used to avoid or to solve problems: If several persons build together an ontology, new included elements can be checked for inconsistency or redundant information can be detected.
Coming back to the mentioned mapping processes, one can state that for the ontology of literary characters, it might be useful, if two persons add categories with the same or similar meaning to the ontology, to produce suggestions for mapping. Applications like mapping or merging are useful for the development and usage of ontologies. But recently, to the author's knowledge, there have been no other ontologies in literature studies which deal with literary characters or narratology and which could be related to this ontology, i.e. by merging or mapping. To be successful, synonymy has to be taken into account and a relating mechanism has to be implemented, because processing of synonymy is not included directly in an ontology so far. Therefore, these tasks cannot be applied completely to the ontology for literary characters at the moment.
Especially in the humanities, where textual data is often crucial, applications
can be used which combine ontologies with text annotations or information
retrieval. Ontologies consist of information, ideas, and/or facts, which also
occur or are repeated in (text) sources that are not included in an ontology.
This might be especially interesting when dealing with ontology learning. For
example, terms contained in both sources, in text annotations/web pages and an
ontology [
Baader et al. 2003, 4] can be queried for. But here
arises the question of whether arranging data that both sources can match would
be a redundant and time-consuming work. In addition, setting up more elaborate
match processes could require a complex system including lexica or external
rules [
Puppe et al. 2000, 635–6], [
Benjamins et al. 1999].
Nevertheless, logic reasoning is not only useful for such processes. Logic
reasoning can also help during users' orientation period. Hierarchical relations
can be highlighted so users can get a structured overview about all elements and their relations. Another aspect is obtaining information that is implicitly hidden in the data.
These examples show only some of the imaginable methods which might be performed by logic reasoning, but already they outline possibilities for an ontology in the humanities. It does not matter for which discipline ontologies are developed; they all have to pass a development, test, and deployment phase and thus, the methods of logic reasoning might be helpful.
However, mapping or merging is yet not possible because of the singularity of the
ontology for literary characters. Such operations might become possible if
further ontologies in this field will become available.
In addition, the human user should be focused when testing logic reasoning. In order
to test logic based applications, several tasks are defined, - which are similar to
the queries given in linguistic approaches [
Lüngen 2007]:
Retrieval of
-
Individuals/instances that belong to a given class.
The results should give information about the relationship between special classes and their instances so that it is possible to obtain information about a character, which is disseminated in a text, e.g. that characters consist of special features whose information is expressed as instances
-
Individuals/instances that contain a given property (and maybe a
given value).
The results should give information about the relationship between instances and special properties and to which class the instance belongs so that the apportionment of additional information can be focused, thereby it might be possible to examine which special kinds of properties only belong to some classes
-
Bottom up/top down.
The result should show the relationship of classes and its super classes, it should also show the arrangement of the features/actions of characters and the structure of the mental representations
Testing Applications of Logic Reasoning
OWL and its Relation to Description Logic
The ontology used in this contribution was implemented in
OWL
Description Logic
[
Antoniou & Harmelen 2003]. The standard OWL consists of
three sub languages (OWL Light, OWL DL, OWL Full) created on different levels of
information representation. Before going on, a short summary of the Description
Logic whose ideas are included in OWL DL: OWL is often used in its sub language OWL
DL, as it inherits concepts of a logic formalism called
Description Logic (DL). In DL, concepts are grouped in a
so-called
TBox and individuals belong to an
ABox. The structure of a TBox is represented as
subsumption so that more general concepts contain special ones, which correspond to
the concepts/classes of an ontology. Regarding the ontology for literary characters,
all classes that have been defined (cf.
Figure 2) are collected in the TBox. An ABox subsumes all instances of the modelled domain. This can be seen as a description of a domain. Coming back to the ontology for literary characters, this means that the ABox is composed of the instances contained in the ontology.
As Nardi and Brachman explain, these methods are usually a variant of first-order
predicate calculus (cf. [
Nardi & Brachman 2003, 2]). Therefore, it is
possible to use the mechanisms of logic formalism like reasoning or drawing
inferences. To operate on data that is formatted or transformed into Description
Logic, inference algorithms have been developed. These algorithms are
implemented in different programmes like FaCT++ [
Tsarkov & Horrocks 2006],
Pellet [
Sirin et al. 2007], and Racer [
Haarslev et al. 2004],
which are DL reasoners, or Prolog
http://www.swi-prolog.org/ which is also able to query OWL ontologies.
Racer and Prolog
In the next sections, the two already-mentioned tools, the inference machine Racer and the logic programming language Prolog, will be described.
Introduction to Racer
Here, we investigate whether Racer (
Renamed ABox and Concept
Expression Reasoner Professional) is a good tool for drawing inferences
on ontologies in the humanities. Racer was developed for approaches that are
based on OWL/ DL and offers inference functions as well as consistency checks
(cf. [
Racer Systems 2005, 1]).
According to the concepts of an ABox and TBox in DL, the classes and individuals
are presented and inferred separately (cf.
Figure 3). To infer in OWL, Racer
uses the query language
nRQL
[
Racer Systems 2005, 87]. This means that the notion in the XML
standard of OWL has to be translated to this query language. Queries in nRQL
consist of a head and a body similar to predicate calculus [
Haarslev et al. 2004].
The syntax for a TBox and ABox differs from each other. By an ABox query, for
example possible individuals of a special class can be retrieved. It is also possible to query for individuals with special properties. So, information nested in individuals and properties can be related to each other. In order to retrieve individuals with a data type property, so-called head projection operators have to be used. In a TBox, classes that do or do not have a given name can be also queried for.
Figure 3 shows how Racer is working in conjunction with an ontology. Except for the graphical notion of TBox and ABox, all queries in nRQL have to be inserted manually. The results are also shown in nRQL. The user has no graphical interface when querying in the ontology.
Another helpful option is to use a special construct that can be used in combination with the classes that should be retrieved. This means that direct hierarchical relations between classes like sub or super relations can be retrieved. But one should note that Racer can only offer a result if the classes are in a hierarchical relation to each other.
In general, when querying an ontology by Racer, different results can be obtained. First, a Boolean is returned (true or false), i.e. an assumption about the ontology can be verified. Second, a literal value is returned, depicting certain information queried for.
All these efforts are reasonable when dealing with huge or (partly) unknown ontologies. Logic reasoning using Racer might help a human to orientate himself/herself and to work easily with such an ontology. Figure 4 shows the processing of possible queries of an ABox or TBox.
Introduction to Prolog
Another possibility for drawing inferences is to use a programming language that
is specialized for logics, e.g. Prolog. It is based on first-order predicate
calculus [
Fisher 2006] and provides unification, backtracking, and tail recursion: operations that are especially useful for ontologies. Inferences can be drawn by using a parser that operates on a so-called
Prolog fact base. This means that, similar to Racer and its query language, OWL ontologies have to be transferred to a notion in Prolog.
Data in Prolog consists of facts and rules. The facts
can be considered a vocabulary on which Prolog works. Facts, called predicates, are defined with arguments included to describe the knowledge, and given to Prolog. A predicate is defined by giving a head and arguments (in brackets). By using rules, the fact base can be enlarged so that different relations and combinations between the modelled elements can be achieved.
Using Prolog as an inference tool for ontologies, the
SWI-Prolog implementation offers several useful modules
for manipulating RDFS and OWL data. In this approach, the RDFS module for
SWI-Prolog has been used
http://www.swi-prolog.org/pldoc/package/semweb.html. By loading an ontology, the module represents the data as triples in a Prolog fact base. The RDFS module can query for the relations between classes, properties, and individuals by using predefined predicates, e.g. relation between classes and instances. But one can also query for instances with a given property.
It is also possible to retrieve instances with a given literal, e.g. special, property value. For comparison reasons, the usage of literals like character names included in the ontology for literary characters is very useful. Thus, all the instances or pieces of information of two literary characters can be retrieved. But it might also be interesting to query for characters created by a special author. For example, a user wants to retrieve every instance with the name of a given author in its property. Information can be filtered. Hierarchical relations of classes, e.g. super-sub class relations can also be queried top down or vice versa. By combining the predicates of the module, a wide range of different queries can be supported.
Discussion
Having outlined the technical principles, in the following, we discuss the
application of logic reasoning to an ontology in literature studies on three levels:
a) the user’s view, b) the methodological limits including ontologies, and c) limits of the tools especially for literature studies.
The task defined in Section 2.2 could be realised by queries formulated in nRQL and Prolog.
Table 1 in the appendix summaries all types of queries implemented and executed so far. The query in nRQL depicted in Figure 5, as an example, as well as in
Table 2 in the appendix, demonstrates the possibilities of using logic reasoning.
Literally, it could be asked if a certain description of a character is featured by a
class (instance-class relation), combination of content of the primary text and narrative/ additional information (instance-property relation), and the hierarchical structure of a character (top-down relation).
Racer and Prolog differ in their basic concept of logics: Prolog is based on
predicate calculus whereas Racer uses the concept of description logic (ABox, TBox
syntax). Both tools have been evaluated in this work. In comparing Prolog and Racer, a huge difference between them is that SWI-Prolog cannot only be used for drawing inferences — the parser can handle all kind of data represented as a Prolog fact base. In contrast, Racer is a solely inference machine.
Both have their own representation and syntax so that a user who is querying an
ontology needs detailed knowledge on handling both programmes, i.e. to draw
conclusion with these tools, the plain representation of the ontology in OWL, an XML
standard, has to be transformed into the programmes’ own representation in advance.
Additionally, a possible user has to be familiar with the concepts of an ontology to
formulate reasonable queries. This assumes certain knowledge in theory in ontologies
and logic but also some technical "hands on" experience for daily use. It is questionable if these skills can be a requirement for a scholar with traditional education in literature studies without proper training. In summary, both programs are reasonable for inference tasks with ontologies in humanities. Both have advantages and disadvantages, especially formulating queries seem to be difficult. SWI-Prolog might be a choice since complete scripts can be executed. On the other hand, Racer provides a graphical representation of ontologies.
As mentioned, an ontology consists of a set of objects, concepts, and further
entities, which are related to each other [
Noy et al. 1997, 53]. These
objects are divided into classes, properties [
Noy et al. 2001, 3].
Additionally, instances of the classes represent individual objects of the selected
domain. By relating main classes and more specialised ones of an ontology, a
hierarchy can be created. Therefore, only information in a hierarchical order can be
modelled using an ontology. Similarities between concepts or classes commonly
occurring when modelling certain information from literature cannot be expressed
directly by an ontology. However, there exist certain structures within the OWL
languages that allow for such relations. But these relations have to be modelled
explicitly during the creation of the ontology. Concerning the description of
literary characters as in this work, theories which tend to have a hierarchical
structure for describing the characters as Nieragden [
Nieragden 1995] should be included. Theories that are based more on network-like structures, however, should be modeled using topic maps or similar techniques, for example.
Besides this disadvantage, one has also to note that vague information can be handled by the ontology through the instantiation of a class. While the class describes the more general and common agreed on concept, the instance refers to a concrete example, also including possible individual interpretations. This kind of modelling has been used for the ontology of literary characters.
In principle, drawing inferences can retrieve implicit and hidden information. The
structure of OWL DL allows logic and formal applications, but the scope of the
logic-based queries is often restricted. Since the logic theorem is based on the
closed world assumption, only information already included in a fact base can be
returned. Mostly the implicit information that possibly exists in an ontology is
anchored in the hierarchical relations between the classes and between classes and
instances. Furthermore, the employed logical concept in description logics only can
give back true or false conclusions (or the elements of the fact base that match a
true result). However, when it is queried for vague information (not modelled via
certain similarity constructs as explained before) this approach will fail since
they cannot be expressed in logical terms [
Berthold 2003].
Furthermore, such information is often ambiguous, or even contradictory [
Zöllner-Weber & Apollon 2008]. Here, fuzzy logics or statistically methods might be
superior. In computer science, many approaches combine logic reasoning with other
methods of information retrieval to overcome the limits of logics. Logical queries
are often embedded in systems, which contain further sources like lexica to extract
more information e.g. ontology learning [
Cimiano et al. 2006].
More generally, when comparing literature studies and computer systems, one has to
take into account that semantics in literature studies often do not match semantics
in machine-to-machine learning [
Veltman 2004]. Often, technical, e.g.
machine-to-machine, semantics are more flat and not too complex, whereas in
literature semantics could be highly complex. As an example in the present work, the
description of the literary characters is dependent on the reader’s interpretation.
Therefore, the modelling of the ontology must allow for several interpretations
side-by-side in the ontology (via instantiations). This is also covered in [
McCarty 2005], where he points out that proper modelling in humanities is important and that the model has to be kept open or flexible, i.e. to be able to include the thinking itself.
Furthermore, one has to note that the foundations i.e. the literary theories of
character descriptions, on which the ontology is built on, mainly have hierarchical
character and thus, are more advantageous for the proposed task. Nevertheless, also
the structure of a hierarchy is critical. Shirky notes that hierarchies are usually
built on a single world-view [
Shirky 2005]. But mostly the weighting or the level of
detail within different branches in the hierarchy is influenced by the modeller or
by economical aspects. Therefore, the hierarchy might not be objective. This might
not be a problem for machine-based processing; however, a user of this ontology might not agree on the presented hierarchy. This risks that the provided system will be abandoned by the user.
Conclusion and Outlook
In this article, we evaluated approaches using logic reasoning applied to an ontology for literary characters. The inference tool Racer and the programming language Prolog were tested and compared to see if they can serve as a computer assisted approach in this scenario.
Both offer options to draw inferences, but the usage requests a good comprehension of
logics. In addition, intuitive and individual queries are also difficult to produce
with solely logic constructs. Furthermore, the hierarchy model of an ontology might
often not be suitable for certain topics in literature studies. For example,
strands of plots that evolve over time cannot be represented by using a hierarchy.
For such analysis, other approaches are more suitable (for example [
Meister 1999]). One should also keep in mind that information in
humanities is often vague, ambiguous, or even contradictory. An example of such a
situation is given by the relations between language and the world in Ludwig
Wittgenstein’s philosophical work
Tractatus. If trying
to model these by an ontology (hierarchy), the author was faced by the fact that the
hierarchy changes during the progress of Wittgenstein’s approach [
Zöllner-Weber & Pichler 2007]. However, a model represented by an ontology remains static. Solving such problems by logic reasoning which is simply based on true or false conclusions will become challenging and might exceed the limits of logic reasoning.
Nevertheless, the potential of logic reasoning for literature should be explored. It is helpful in the different phases of the creation and deployment of ontologies in the humanities and its sub disciplines like literature studies. For example, when different experts are working on the same ontology, its consistency can be checked and errors can be minimised in an early state of an ontology.
Probably, when dealing with such information and logic systems, Fuzzy Logic should be
considered [
Halpern 2005]. It offers a range of values that weight a
conclusion rather than a binary decision as the logics described in this
contribution. Also, combinations with other types of systems, like additional knowledge bases, might support current ontology approaches in humanities and literature studies.
At the moment, to the author’s knowledge, only few such approaches, especially in
literature studies, have been published. Therefore, a general judgement about
the value of computer-aided approaches as outlined in this contribution could
not be demonstrated to its limits. Existing approaches (for example [
Farrar et al. 2002], [
Lüngen 2007], [
Zöllner-Weber & Witt 2006]) show promising results in modelling information in humanities. Therefore, further research should be directed to make ontologies and further approaches like logic reasoning even more popular and applicable in the humanities and literature studies.
Appendix
Table 1.
| Query for instance class relation: |
rdf(INSTANCE,'http://www.w3.org/1999/02/22-rdf-syntax-ns#
type',CLASS),
rdf(CLASS,'http://www.w3.org/1999/02/22-rdf-syntax-ns#type',
'http://www.w3.org/2002/07/owl#Class'). |
| Query for instances that belong to one class: |
rdf_has(X,Y,'http://www.figurenontologie.de/
#on27_action_and_behaviour'). |
| Query for top down relation: |
rdf_has(Y,Z,X),rdf_has(A,B,Y). |
| Querying hierarchies: |
rdf_has(SUBCLASS,'http://www.w3.org/2000/01/rdf-schema#subClassOf',SUPERCLASS),
rdf_has(SUPERCLASS,'http://www.w3.org/2000/01/rdf-schema#subClassOf',SUPERSUPER),
rdf_has(SUPERSUPER,'http://www.w3.org/2000/01/rdf-schema#subClassOf',SUPER). |
| Query for sibling classes: |
rdf_has(SUBCLASS1,'http://www.w3.org/2000/01/rdf-schema#subClassOf',SUPERCLASS),
rdf_has(SUBCLASS2,'http://www.w3.org/2000/01/rdf-schema#subClassOf',SUPERCLASS),SUBCLASS1\=SUBCLASS2. |
Table 2.
| An ABox query: |
(RETRIEVE(?X)(?X |http://www.figurenontologie.de/unnamed.owl#
on4_general_notes|)) |
| A TBox query with negation: |
(TBOX-RETRIEVE(?X) (NEG (?X |http://www.figurenontologie.de/ unnamed.owl#on43_statement_about_the_analised_character|))) |
| Query with head projection operator: |
(RETRIEVE(?X(TOLD-VALUE
(|http://www.figurenontologie.de/unnamed.owl# storytitle|?X)))
(?X|http://www.figurenontologie.de/unnamed.owl# on43_statement_about_the_analised_character|)) |
Works Cited
Antoniou & Harmelen 2003 Antoniou, G. and van
Harmelen, F. "Web ontology Language: OWL." In S. Staab and R. Studer (eds), Handbook on Ontologies, Springer, Berlin (2003), pp. 67-92.
Baader & Nutt 2003 Baader, F. and Nutt, W. "Basic Description Logics." In F. Baader, D. Calvanese, D. L. McGuinness, D. Nardi, and P.F. Patel-Schneider (eds), The Description Logic Handbook, Cambridge University Press. Cambridge (2003), pp. 43-95.
Baader et al. 2003 Baader, F., Horrocks, I. and Sattler,
U.: "Description Logics." In Handbook on Ontologies Ed. Steffen Staab and Rudi Studer. Berlin (2003). pp.3-28.
Benjamins et al. 1999 Benjamins, V. R., Fensel, D.,
Decker, S., and Gómez Pérez, A. "(KA)2: building ontologies for the
Internet: a mid-term report." In International Journal of Human-Computer Studies, 51: 687-712.
Berthold 2003 Berthold, M. Chapter 9: Fuzzy Logic. In M.
Berthold and D.J. Hand (eds), Intelligent Data Analysis, Springer, Berlin (2003), pp. 321-350.
Charniak & McDermott 1985 Charniak, E. and McDermott, D. Introduction to Artificial Intelligence. Reading MA (1985).
Cimiano et al. 2006 Cimiano, P., Völker, J., and Studer,
R. "Ontologies on Demand? - A Description of the State-of-the-Art,
Applications, Challenges and Trends for Ontology Learning from Text." In Information, Wissenschaft und Praxis, 57 (6-7): 315-320.
Farrar et al. 2002 Farrar, S., Lewis, W. D., and
Langendoen, D. T. "An Ontology for Linguistic Annotation." In Semantic Web Meets Language Resources: Papers from the AAAI Workshop, Technical Report WS-02-16, Menlo Park, CA 2002, pp. 11–19.
Gomez 2004 Gomez, F. "Grounding the
ontology on the Semantic Interpretation Algorithm." In Proceedings of the Second Global WordNet Conference, Brno, Czech Republic 2004. pp.124–129.
Greimas 1971 Greimas, A. J. Strukturale Semantik: methodologische Untersuchungen. Braunschweig 1971.
Gruber 1993 Gruber, T. R. "A
translation approach to portable ontology specifications." In Knowledge Acquisition, 5(2): 199–221.
Haarslev et al. 2004 Haarslev, V., Möller, R., and Wessel, M. Querying the Semantic Web with Racer + nRQL. Proceedings of the KI-2004 International Workshop on Applications of Description Logics (ADL'04), Ulm, Germany, September 2004.
Halpern 2005 Halpern, J. Y. Reasoning about uncertainty. Cambridge, Mass: MIT Press.
Jannidis 2004 Jannidis, F. Figur und Person - Beitrag zur historischen Narratologie. Walter de Gruyter, Berlin (2004).
Lüngen 2007 Lüngen, H. and Storrer, A. "Domain ontologies and wordnets in OWL: Modelling options." In Zeitschrift für Computerlinguistik und Sprachtechnologie, 22(2): 1–17.
McCarty 2005 McCarty, W. Humanities Computing. Palgrave Macmillan, London (2005).
Meister 1999 Meister, J. C. Jenseits von Lagado. "Literaturwissenschaftliches Programmieren am Beispiel des Kodierungsprogramms Move Parser 3.1." In Deubel, Volker; Eibl, Karl; Jannidis, Fotis (Hgg.): Jahrbuch für Computerphilologie 1. Paderborn 1999, 71-83.
Nardi & Brachman 2003 Nardi, D., Brachman, R. "An Introduction to Description Logics." In Description Logics
Handbook, Cambridge University Press, Cambridge (2003).
Nieragden 1995 Nieragden, G. Figurendarstellung im Roman: Eine narratologische Systematik am Beispiel von David Lodges Changing Places und Ian McEwans The Child in Time. Wissenschaftlicher Verlag Trier, Trier (1995).
Noy 2003 Noy, N. F. "Tools for Mapping
and Merging Ontologies." In Handbook on Ontologies. Ed. Steffen Staab and Rudi Studer. Berlin 2003. pp.365-384.
Noy et al. 1997 Noy, N. F. and Hafner, C.D. The State of the
Art in Ontology Design. In AI Magazine 18 (3) (1997). pp.53-74.
Philosophie-Lexikon 1991 Philosophie-Lexikon. A. Hügli and P. Lübcke (eds), Reinbek bei Hamburg (1991).
Propp 1968 Propp, V. Morphology of the Folktale. London 1968.
Puppe et al. 2000 Puppe, F., Stoyan, H., and Studer, R.
Kapitel 15: Knowledge Engineering. In G. Görz, C.-L.Rollinger, and J. Schneeberger (eds), Handbuch der Künstlichen Intelligenz. Oldenbourg Verlag, München (2000), pp. 599–641.
Renear et al 2006 Renear, A. H., Yunseon, C., Jin Ha, L.,
and Schmidt, S. Axiomatizing FRBR: An Exercise in the Formal Ontology of Cultural Objects.
Proceedings of Digital Humanities 2006. Paris 2006. pp. 164-170.
Shirky 2005 Shirky, C. Ontology is Overrated: Categories, Links, and Tags. O'Reilly ETech conference. 2005.
Sirin et al. 2007 Sirin, E., Parsia, B. Cuenca Grau, B.,
Kalyanpur, A. and Katz, Y. Pellet: A practical OWL-DL reasoner. Journal of Web Semantics, 5(2), 2007. pp. 51-53.
Smith 2007 Smith, D. C. P. Re-Discovering Wittgenstein. In: Proceedings of 30th
International Wittgenstein Symposium, Kirchberg am Wechsel 2007. pp.208-210.
Stevens et al. 2000 Stevens, R, Goble, CA, and Bechhofer, S. Ontology-based knowledge representation for bioinformatics. Brief Bioinform. 2000; 1: 398-414.
Tsarkov & Horrocks 2006 Tsarkov, D. and Horrocks, I. FaCT++ Description Logic Reasoner: System Description. Proceedings of the International Joint Conference on Automated Reasoning (IJCAR 2006) Vol. 4130, Lecture Notes in Artificial Intelligence, Springer, Berlin (2006), pp. 292–297.
Ziegler 2004 Ziegler, C. Surfende Maschinen. In Das XML-Kompendium. Hannover (2004). pp. 126-130.
Zöllner-Weber & Apollon 2008 Zöllner-Weber, A. and Apollon,
D. "The Challenge of Modelling Information and Data in the
Humanities." In Proceedings of the International Conference on Explorations of new Spaces, Relations and Dynamics in Digital Media Ecologies. Innsbruck: Innsbruck University Press 2008.
Zöllner-Weber & Pichler 2007 Zöllner-Weber, A. and Pichler,
A. "Utilizing OWL for Wittgenstein's Tractatus." 30th International Wittgenstein Symposium. Kirchberg am Wechsel 2007. pp.248-250.
Zöllner-Weber & Witt 2006 Zöllner-Weber, A. and Witt, A.
"Ontology for a Formal Description of Literary Characters."
Proceedings of Digital Humanities 2006. Paris 2006. pp. 350-352.
Zöllner-Weber 2006 Zöllner-Weber, A. "Formale Repräsentation und Beschreibung von literarischen Figuren." In G. Braungart, P. Gendolla, and F. Jannidis (eds), Jahrbuch für Computerphilologie 7. mentis Verlag, Paderborn (2006), pp. 187–203.
Zöllner-Weber 2007 Zöllner-Weber, A. "Noctua literaria - A System for a Formal Description of Literary Characters." In G. Rehm, A. Witt, A., and L. Lemnitzer (eds), Data Structures for Linguistic Resources and Applications. Tübingen (2007). pp.113-121.