(Out-of-date) Grammar Summary for ERG from 2006
See http://moin.delph-in.net/GrammarCatalogue for current information.
|
|
Number of lexical leaf types |
864 |
Total number of lexical types |
1822 |
Number of lexical rules |
26 |
Number of syntactic rules |
153 |
Total number of types (no GLBs) |
4303 |
Lexical entries: Hand-built |
24191 |
Lexical entries: External source |
0 |
Lines of TDL (excl lexicon) |
28610 |
Lines of comments |
5757 |
|
|
External morphology |
No |
Preprocessor |
Yes: finite state in LKB |
Lexical database |
Yes |
Unknown word mechanism |
Yes: TNT POS-based in PET |
Idioms |
Yes |
|
|
Test suites |
DELPHINROOT/lingo/lkb/src/tsdb/skeletons |
- name:domain (items) |
csli:phenom (1348) mrs:semantics (107) |
|
hike:tourism (330) rondane:tourism (1424) |
|
logon:tourism (9411) vm:meetings (12393) |
|
ec:ecommerce (5867) trec9:q-a (693) |
Treebanks |
http://www.delph-in.net/redwoods and |
|
DELPHINROOT/lingo/erg/gold/ |
Parse-ranking model |
Yes, from LOGON treebanks |
Generation (trigger rules) |
Yes |
Realization-ranking model |
Yes, from LOGON treebanks |
Paraphrasing rules |
Yes |
SEM-I |
Yes |
Application(s) |
MT, email response |
Processing engines |
LKB, PET, ACE, (LILFES) |
Operating systems |
Linux, Windows, MacOS |
Notes |
|
- Lines of TDL (excl lexicon)
mkdir /tmp/counttdl
cd <grammardir>
cp *.tdl /tmp/counttdl
cd /tmp/counttdl
rm lexicon.tdl
cat *.tdl > total
wc -l total
grep -e '^;' *.tdl | wc -l
- Number of lexical leaf types
cd /tmp/counttdl
grep "_le :=" *.tdl | wc -l
- Total number of lexical types
cd /tmp/counttdl
cat <lextypefiles> > ltypes
grep ":=" ltypes | wc -l
- Lexical entries - Hand-built
cd <grammardir>
grep ":=" <lexfiles> | wc -l
- Number of syntactic rules
cd <grammardir>
grep ":=" <rules> | wc -l
- Total number of types (no GLBs)
cd /tmp/counttdl
grep ":=" total | wc -l
grep ":<" total | wc -l