GrammarsIntroduction - skilchen/bots GitHub Wiki

Introduction

Definition: A grammar is a description of an edi-message.
A grammar describes the records, fields, formats etc of an edi file.
Bots uses grammars to parse, check and generate edi files.
Grammars files are independent of the editype: a grammar for a csv files looks the same as a grammar for a x12 file.
Grammar files are in: usersys/grammars/editype/grammar name.py

Learn grammar by example

Best way to get the idea of a grammar is to look at the (simple) example in the chapter.

This CSV file:

HEADER,ordernumber1,buyer1,20120524
LINE,1,article1,24,description1
LINE,2,article2,288,
LINE,3.article3,6,description3

has this grammar:

from bots.botsconfig import *        #always needed

syntax = {                           #'syntax' section
'field_sep' : ',',               #specify field separator
'charset'   : 'utf-8',           #specify character-set
}

structure = [                        #'structure' section
{ID:'HEADER',MIN:1,MAX:999,LEVEL:[   #each order starts with a HEADER record
{ID:'LINE',MIN:1,MAX:9999},      #nested under the HEADER record are the LINE records, repeat up to 9999 times
]}
]

recorddefs = {                       #'recorddefs' section
'HEADER':[                           #specify the fields in the HEADER record
['BOTSID','M',6,'A'],            #BOTSID is for the HEADER tag itself
['order number', 'M', 17, 'AN'], #for each field specify the format, max length, M(andatory) or C(onditional)
['buyer ID', 'M', 13, 'AN'],
['delivery date', 'C', 8, 'AN'],
],
'LINE':[
['BOTSID','M',6,'A'],
['line number', 'C', 6, 'N'],
['article number', 'M', 13, 'AN'],
['quantity', 'M', 20, 'R'],
['description', 'C', 70, 'R'],
],
}

The example above is simple, but fully functional.

Sections of a grammar

A grammar file consists of these sections:

syntax: parameters for the grammar like field separator, merge or not, indent xml, etc.
structure: sequence of records in an edi-message: start-record, nested records, repeats.
recorddefs: fields per record.
nextmessage: to split up an edi file to separate messages.
nextmessageblock: to split up a cvs-file to messages.

A section can be reused/imported from another grammar file.
Purpose: better maintenance of grammars.
Example: edifact messages from a certain directory use the same recorddefs/segments:

 from recordsD96AUN import recorddefs

One edifact grammar consists of four parts. Example:

edifact.py (contains syntax common to all edifact grammars)
envelope.py (contains envelope structure and recorddefs common to all edifact grammars)
recordsD96AUN.py (contains recorddefs common to all edifact D96A grammars)
ORDERSD96AUN.py (contains structure specifically for ORDERS D96A)

Problems for some edifact grammars on sourceforge site

Sometimes you might meet this error for a grammar:

GrammarError: Grammar "...somewhere...", in structure: nesting collision detected at record "etc etc". This is the case eg with INVRPT D96A.
UN says about this that you have to make additional choices in message; either you make some segments mandatory or leave out some segment groups.
EANCOM did make such choices in their implementation guidelines.
So: you can not use the grammar directly, edit it according to your needs. This is according to what UN-edifact wants...