Tokenizer Choice For LLM Training: Negligible or Crucial? Paper • 2310.08754 • Published Oct 12, 2023 • 3
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings Paper • 2202.06671 • Published Feb 14, 2022 • 2
Specialized Document Embeddings for Aspect-based Similarity of Research Papers Paper • 2203.14541 • Published Mar 28, 2022
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning Paper • 2301.09626 • Published Jan 23, 2023 • 2
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 49
deutsche-telekom/gbert-large-paraphrase-euclidean Sentence Similarity • Updated Aug 24, 2024 • 6.84k • • 12
deutsche-telekom/gbert-large-paraphrase-cosine Sentence Similarity • Updated Aug 24, 2024 • 10.1k • • 27
deutsche-telekom/Llama-3.1-MoE-8x8B-Instruct-raw Text Generation • 47B • Updated Aug 23, 2024 • 5 • 1
CarExpert: Leveraging Large Language Models for In-Car Conversational Question Answering Paper • 2310.09536 • Published Oct 14, 2023
DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation Paper • 2204.09149 • Published Apr 19, 2022 • 1
Grounding Dialogue Systems via Knowledge Graph Aware Decoding with Pre-trained Transformers Paper • 2103.16289 • Published Mar 30, 2021