Home - VincTheSecond/rextractor GitHub Wiki
RExtractor
Welcome to RExtractor wiki pages. RExtractor is an open-source system for detection entities and relations in unstructured texts. RExtractor system is the result of the project INTLIB supported from the Technology Agency of the Czech Republic (grant no. TA02010182).
You are in the right place if you are looking for exact documentation for administrators and programmers who would like to use or contribute on RExtractor system. Feel free to contact me if you have any questions, bugs or feature requests.
You can visit RExtractor DEMO web pages to play with the system. Demonstration presents our very first use case -- extraction of definitions, rights and obligations from Czech law texts.
For more details you can also visit RExtractor project web page at the Institute of Formal and Applied Linguistics. (Link will be available soon.)
Here is the content of RExtractor technical documentation:
-
Overview
- File structure
- RExtractor Architecture
- List of states
-
Server interfaces
-
XML Formats
- Internal XML Format
- Output Description File
- PML Language format
- Database of Entities
- Database of Relations