Towards Explainable Semantic Text Matching

Last modified Oct 22, 2018

word embeddings tenancy law legal informatics word2vec tfidf fasttext semantic text matching inproceedings text similarity measure explainable ai (xai) explainable semantic text matching publication

Abstract

The growing amount of textual data in the legal domain leads to a demand for better text analysis tools adapted to legal domain specific use cases. Semantic Text Matching (STM) is the general problem of linking text fragments of one or more document types. The STM problem is present in many legal document analysis tasks, such as argumentation mining. A common solution approach to the STM problem is to use text similarity measures to identify matching text fragments. In this paper, we recapitulate the STM problem and a use case in German tenancy law, where we match tenancy contract clauses and legal comment chapters. We propose an approach similar to local interpretable model-agnostic explanations (LIME) to better understand the behavior of text similarity measures like TFIDF and word embeddings. We call this approach eXplainable Semantic Text Matching (XSTM).

Incoming references

Conference (0)

Publication(s)

Files and Subpages

Name	Type	Size	Last Modification	Last Editor
La18c.pdf	File	261 KB	22.10.2018