Continuous Vector In-Context Learning
🔗 Source: arXiv
VECTOR-ICL: IN-CONTEXT LEARNING WITH CONTINUOUS VECTOR REPRESENTATIONS
🚀 Technical Novelty
- Mechanism: A lightweight, trainable projector maps arbitrary encoder embeddings into the LLM’s native text embedding space, allowing continuous vectors to function as direct in-context demonstrations.
- Nuance: Unlike standard ICL that relies on discrete tokenization or natural language prompts, Vector-ICL bypasses lexical constraints entirely, enabling seamless cross-modal few-shot adaptation without modifying the base LLM weights.
💡 Yield
- Achieves competitive or superior few-shot performance across text classification, summarization, numerical function regression, time-series, graph classification, and fMRI decoding compared to standard ICL and domain-specific tuned models.
- Demonstrates that pretraining projectors on general language modeling objectives is sufficient to unlock Vector-ICL, while task-specific fine-tuning further amplifies accuracy without retraining the decoder.
⚠️ Limitations
- Strictly dependent on external black-box encoders to generate initial embeddings; performance is bounded by the quality and alignment of those source representations.
- Cross-modal tasks often require non-linear projectors rather than simple linear mappings, increasing architectural complexity and training overhead for heterogeneous data types.