Cross Document Language Modeling
Cross-Document Language Modeling involves pre-training language models to support multi-document natural language processing tasks. It aims to enhance the model's semantic understanding and generation capabilities across a collection of documents by leveraging cross-document contextual associations, thereby improving the performance of applications such as information retrieval, document summarization, and multi-document question answering.