Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.
| Tags | Database Database Engines/Servers Text Processing Linguistic General Indexing Information Management education |
|---|---|
| Licenses | GPL |
| Operating Systems | Windows Windows POSIX Linux Solaris Mac OS X |
| Implementation | C++ |
Recent releases


Changes: A serious bug in the topographic language that could lead to incomplete or misleading query results was fixed. The Emdros Query Tool was given tree output and KWIC output options, as well as new ways of "harvesting" results. Lots of other small improvements and bugfixes were made.


Changes: Support for Solaris was reinstated.


Changes: Various speed improvements were gained. Bugfixes were made. SQLite3 was upgraded to version 3.4.2. The CREATE DATABASE statement now optionally takes an "encoding" parameter. The Java bindings were improved.


Changes: The topographic part of the language was expanded. A TIGER XML importer was added. Beta quality C# bindings were added. Bugfixes were made, and the regression test suite was enhanced.


Changes: Speed improvements were made to statement creation and querying the database. SQLite 3 was upgraded to the latest version. The SQLite 3 schema was changed, necessitating an mqldump and reimport of SQLite 3 databases. Many bugfixes were made.