Using Semantic Unification to Generate Regular Expressions from Natural Language
Nate Kushman and Regina Barzilay
We consider the problem of translating natural language text queries into
regular expressions which represent their meaning. The mismatch in the level
of abstraction between the natural language representation and the regular
expression representation make this a novel and challenging problem. However,
a given regular expression can be written in many semantically equivalent
forms, and we exploit this flexibility to facilitate translation by finding a
form which more directly corresponds to the natural language. We evaluate our
technique on a set of natural language queries and their associated regular
expressions which we gathered from Amazon Mechanical Turk. Our model
substantially outperforms a state-of-the-art semantic parsing baseline,
yielding a 29% absolute improvement in accuracy.
Back to Papers Accepted