Known Issues - sjbutler/jim-core GitHub Wiki

There are a few issues that might be encountered when using jim-core that are understood. Please check through this page, as well as issues and PRs before reporting a problem.

File encoding

jim-core uses ANTLR to generate the lexer and parser used for Java. The ANTLR library classes that process the content of source code files, convert files to UTF-8 before passing the content to the lexer. The majority of the time this is not problematic. However, there have been some occasions where files have been labelled with the incorrect encoding in the file system and errors occur when they are converted to UTF-8. Observed cases have been with UTF-8 files that have been wrongly labelled as ISO-8859-1 or Latin-1 encoded. In these cases vowels with diaeresis (umlauts) and the German eszet character are incorrectly encoded. Before reporting errors, please check that the file encoding is correct.