[PDF] Estimating near-verbatim extraction risk in language models with decoding-constrained beam search

AF Cooper, MA Lemley, C De Sa, L Duesterwald… - arXiv preprint arXiv …, 2026

Recent work shows that standard greedy-decoding extraction methods for
quantifying memorization in LLMs miss how extraction risk varies across sequences.
Probabilistic extraction--computing the probability of generating a target suffix given …

[PDF] Vocabulary shapes cross-lingual variation of word-order learnability in language models

JM Martins, J Jumelet, V Priesemann, L Beinborn - arXiv preprint arXiv:2603.19427, 2026

Why do some languages like Czech permit free word order, while others like English
do not? We address this question by pretraining transformer language models on a
spectrum of synthetic word-order variants of natural languages. We observe that …

[PDF] Taming the Phantom: Token-Asymmetric Filtering for Hallucination Mitigation in Large Vision-Language Models

S Ouyang, H Wang, G Fang, X Ma, L Lin, X Wang - … of the AAAI Conference on Artificial …, 2026

Hallucination in Large Vision-Language Models (LVLMs) remains a critical
challenge, undermining their reliability in real-world applications. Existing studies
have investigated the causes of hallucination at the modality level and proposed …

[PDF] TOP-RL: Task-Optimized Progressive Token Pruning with Reinforcement Learning for Vision Language Models

H Wang, W Xie, H Jiang, Y Wei, K Jiang, M Cao, C Hao… - Proceedings of the AAAI …, 2026

In recent years, Large Vision-Language Models (LVLMs) have significantly
advanced multimodal tasks. However, their inference requires intensive processing
of numerous visual tokens and incurs substantial computational overhead. Existing …

[PDF] Instruction-Guided Cross-Modal Clustering for Training-Free Visual Token Pruning in Vision-Language Models

Y Yu, B Chen, Y Zhang, T Xie, M Jing, L Zuo - … of the AAAI Conference on Artificial …, 2026

Large vision-language models (LVLMs) have demonstrated remarkable capabilities
in understanding multimodal data such as images and text. However, the number of
visual tokens in these models often far exceeds that of textual tokens, resulting in …

[PDF] Beyond Counting: Evaluating Abstract and Emotional Reasoning in Vision-Language Models

Y Zhou, Y Zhang, J Chang, X Gu, Y Wang, K Ding… - Proceedings of the AAAI …, 2026

Despite the rapid progress of Vision Language Models (VLMs), existing benchmarks
still concentrate on coarse-grained object recognition or simple relational reasoning,
leaving the fine-grained and higher-order reasoning abilities of these systems largely …

This message was sent by Google Scholar because you're following new articles related to research by Anthony (Tony) G Cohn.

Cancel alert

[PDF] Estimating near-verbatim extraction risk in language models with decoding-constrained beam search

[PDF] Vocabulary shapes cross-lingual variation of word-order learnability in language models

[PDF] Taming the Phantom: Token-Asymmetric Filtering for Hallucination Mitigation in Large Vision-Language Models

[PDF] TOP-RL: Task-Optimized Progressive Token Pruning with Reinforcement Learning for Vision Language Models

[PDF] Instruction-Guided Cross-Modal Clustering for Training-Free Visual Token Pruning in Vision-Language Models

[PDF] Beyond Counting: Evaluating Abstract and Emotional Reasoning in Vision-Language Models

[PDF] PosPrune: Visual Token Pruning with Positional Bias Correction for Efficient Large Vision-Language Models

[PDF] HiSpatial: Taming Hierarchical 3D Spatial Understanding in Vision-Language Models

[PDF] GraphVLM: Benchmarking Vision Language Models for Multimodal Graph Learning

[PDF] Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models