Consider the problem of determining the next action of an agent trying to accomplish a goal,
We can factor out a prior by formulating using Bayes Rule:
where the first term is the likelihood and second prior, which can be a pretrained language model if
In our case, we use BabyAI as the environment. The prior is GPT2 fine-tuned on many goals, where
The prior achieves decent accuracy (~70%) and the likelihoods achieve very high accuracy (~93%) with low data (few hundred data points) only on the given goal.
View detailed experiment logs here.