top of page

Currently, sentence embedding methods can be trained on large corpora such as Wikipedia. However, if the task is specific to a certain topic, results may not be good since the relationships between words in general corpora do not always correspond to those in a specialized corpus. The solution is to bias the text embedding based on a large general corpus by information from the specialized corpus.

bottom of page