The Donatus system provides a unified frontend to a variety of morphological analysis software and databases. Morphological services are provided both through a Remote Procedure Call (RPC) interface that can be utilized by specialized user applications and through a CGI interface that is accessible in any web browser. Morphological data can be represented in XML, allowing them to be cached on client systems and to be processed by a wide range of software. In addition to providing access to pre-existing linguistic data, Donatus allows for the dynamic extension of morphological datasets by a user.
Donatus is intended to be one component in an evolving framework for network-accessible linguistic services. These services may be described as linguistic middleware; the idea is to provide a simple but powerful interface layer that will allow linguistic tools and data sources to be employed in end-user applications, such as browsers and editors (these we describe generally as user agents). Other services in this framework will provide segmentation or tokenization of natural language data and orthographic normalization.