-
Notifications
You must be signed in to change notification settings - Fork 132
ECJ Investigating
< JDT Core Programmer Guide | ECJ
For initial triage it is useful to test how different compiler versions
respond to a given source file. This can be done quickly using a shell
and a few shell scripts. The following assumes bash
.
ecj-HEAD.sh
: Run the compiler from class files of an existing
JDT/Core workspace / git clone:
#!/bin/sh
# Adjust path to the root of a git clone of JDT/Core:
JDT_GIT=/home/git/eclipse.jdt.core
# Adjust path to a suitable Java installation:
JAVA_HOME=${JAVA_HOME:-/home/java/jdk-17}
# Project list valid since https://github.com/eclipse-jdt/eclipse.jdt.core/issues/181
ECJ_BINS=${JDT_GIT}/org.eclipse.jdt.core.compiler.batch/bin
PATH=${JAVA_HOME}/bin:${PATH}
java -classpath $ECJ_BINS org.eclipse.jdt.internal.compiler.batch.Main $*
ecj.sh
Run the compiler from one out of a set of stored ecj jar
files:
Assume you have a directory ($BASE
in the script below) holding
different versions of ecj, with file names like ecj-4.16.jar
, then the
following script can be used to pick and use any of the stored versions:
#!/bin/sh
BASE=${HOME}/jdt/ecj
ECJ=${BASE}/ecj-${1}.jar
if [ -f ${ECJ} ]
then
echo "Compiling with $ECJ"
else
echo "Not found: ${ECJ}"
echo "Available:"
cd ${BASE}
ls ecj-*.jar
exit 2
fi
EXTRA="-proc:none"
case ${1} in
4.[4-7]*)
JAVA_HOME=/home/java/jdk1.8.0
PATH=${JAVA_HOME}/bin:${PATH}
;;
4.[0-3].*)
JAVA_HOME=/home/java/jdk1.6.0
PATH=${JAVA_HOME}/bin:${PATH}
;;
3.[0-9]*)
JAVA_HOME=/home/java/jdk1.5.0
PATH=${JAVA_HOME}/bin:${PATH}
EXTRA=""
;;
esac
shift
echo java -jar ${ECJ} ${EXTRA} $*
java -jar ${ECJ} ${EXTRA} $*
Example invocation: $ ecj.sh 4.16 -1.8 Hello.java
The extra argument -proc:none
is useful because some historic ecj
versions lack the annotation processing classes and thus crash when
trying to perform annotation processing.
Also note, that different version ranges of ecj require different Java versions to run. Adjust paths in the script to your local installations.
If the compiler behavior changed significantly at a yet unknown point in time, the following appears to be the fasted approach to find the exact commit causing the change:
- Use the parametric script
ecj.sh
to find the release or even milestone that first introduced the change. - Identify git tags of last version with old behavior and first version with new behavior
- In a shell inside a JDT/Core git clone perform the
git bisect
subcommands - In an Eclipse workspace that has org.eclipse.jdt.core from that git imported, refresh the git repo and wait for auto build to complete
- In another shell where you have your test source invoke
ecj-HEAD.sh
- Check the result and go back to 3. until done
It is useful to have a git clone and eclipse workspace dedicated only to
bisecting, so that you never mess up your regular workspace. This is
reflected by a copy of ecj-HEAD.sh
where ${JDT_GIT}
points to that
instance of org.eclipse.jdt.core.
In most cases, it is sufficient to check which compiler version accepts / rejects the given test program, possibly checking also for warnings.
In other cases it is also necessary to compare executions of the compiled test program.
In rare case it may even be necessary to also check the compiler output files (e.g., to check class file attributes like annotations etc). When different compiler versions generate different byte code, the following sequence would identify that difference:
$ ecj.sh 4.10 Hello.java
$ javap -p -v -c Hello.class > Hello-4.10.list
$ ecj.sh 4.11 Hello.java
$ javap -p -v -c Hello.class > Hello-4.11.list
$ diff Hello-4.10.list Hello-4.11.list
Obviously, a graphical diff tool (like kompare
) comes in handy for
this task.
For high level investigation of some complex machinery in the compiler, console logging may be a good starting point. Variables controlling this can be found here:
-
org.eclipse.jdt.internal.compiler.parser.Parser.DEBUG
and..DEBUG_AUTOMATON
-
org.eclipse.jdt.internal.compiler.parser.diagnose.DiagnoseParser.DEBUG
and..DEBUG_PARSECHECK
org.eclipse.jdt.internal.compiler.lookup.InferenceContext18.DEBUG
Error reporting
- Inside
ProblemHandler.handle(int,String[],int,String[],int,int,int,ReferenceContext,CompilationResult)
- Breakpoints after
case ProblemSeverities.Error
andcase ProblemSeverities.Warning
will be triggered by every detected problem of the given severity (although some may be filtered out duringCompilationUnitDeclaration.finalizeProblems
).
- Breakpoints after
- Breakpoints in constructors of
Problem*Binding
: this bindings are created when resolving found an error, sometimes the creation of the problem binding is closer to the cause then the above handle() method. - To find the locations responsible for a particular known error
message proceed as follows:
- Inside
org/eclipse/jdt/internal/compiler/problem/messages.properties
locate the error message and identify the error number at the start of that text line - Find the same number in
org.eclipse.jdt.core.compiler.IProblem
and search for references of the corresponding constant - You will have found one or more methods in
org.eclipse.jdt.internal.compiler.problem.ProblemReporter, call
hierarchy of its callers are what you are looking for
- Shortcut: call hierarchy on the constant in IProblem (make sure the Field access option from the view menu is not set to Write access).
- Inside
Type lookup from the environment
- Methods
LookupEnvironment.askForType
(both overloads!): every additional type pulled into compilation must be found by one of these methods.
Parsing
- A breakpoint on the first line of method
Parser.consumeRule(int)
approximates single stepping through the parse process.- Note, that Parser & Scanner have useful toString() methods, so the progress of parsing can be watched in the details of the Variables view.
TypeInference
- I'll write a section dedicated only to debugging type inference
Checking compiler output during tests
-
AbstractRegressionTest.runTest()
(the core method, longest parameter list): a breakpoint immediately afterbatchCompiler.compile()
let's you inspect the generated class files, to be found in your tmp directory atcomptest/run
/regression
.
Completion
- An exception breakpoint on
org.eclipse.jdt.internal.codeassist.complete.CompletionNodeFound
will trigger when the completion node is resolved.- Use this breakpoint if the AST looks OK, but problems exist downstream
Other exceptions used in the compiler
Exceptions where throw and catch are near to each other are not listed here.
-
AbortCompilation
and subtypes: thrown when severe problems suggest that nothing good will come afterwards -
CopyFailureException
: to enable multiple resolution attempts, lambda expressions are copied (AST). In case of syntax errors, this may not be possible. -
ClassFormatException
: when a .class file cannot be interpreted. -
InferenceFailureException
: initially used to detect gaps in the implementation of type inference, mostly obsolete by now.
Investigating generated class files
If a compiler regression test fails in the phase of executing the generated class files, you may want to take a closer look at those class files, but they will be gone after the test finished.
- In
AbstractRegressionTest
find thatrunTest
method with the longest parameter list.- within that method set a breakpoint at or near line
nameEnvironment.cleanup();
, at which point the class files have just been created, so you may want to terminate the test and head to a shell / command prompt for further analysis.
- within that method set a breakpoint at or near line
- regression tests are executed below your system's TMP directory, in a
comptest/run.<timestamp>/regression
subdirectory.