PONV Daily@daily_ponv P.1704

DAILY_PONV Telegram 1704

Harnessing the Universal Geometry of Embeddings

We present the first method to translate text embeddings across different spaces without any paired data or encoders.
Our method, vec2vec, reveals that all encoders—regardless of architecture or training data—learn nearly the same representations!
We demonstrate how to translate between these black-box embeddings without any paired data, maintaining high fidelity.
Using vec2vec, we show that vector databases reveal (almost) as much as their inputs.
Given just vectors (e.g., from a compromised vector database), we show that an adversary can extract sensitive information (e.g., PII) about the underlying text.

Strong Platonic Representation Hypothesis (S-PRH)
We thus strengthen Huh et al.'s PRH to say:
The universal latent structure of text representations can be learned and harnessed to translate representations from one space to another without any paired data or encoders.

https://arxiv.org/abs/2505.12540

Harnessing the Universal Geometry of Embeddings

We introduce the first method for translating text embeddings from one vector space to another without any paired data, encoders, or predefined sets of matches. Our unsupervised approach...

👍3🤯3❤1

www.tgoop.com/daily_ponv/1704

1.4K viewsSergey Kucherenko, edited May 22 at 12:17

tgoop.com/daily_ponv/1704

Create: 2025-05-22
Last Update: 2025-09-06 04:18:03

Harnessing the Universal Geometry of Embeddings

We present the first method to translate text embeddings across different spaces without any paired data or encoders.
Our method, vec2vec, reveals that all encoders—regardless of architecture or training data—learn nearly the same representations!
We demonstrate how to translate between these black-box embeddings without any paired data, maintaining high fidelity.
Using vec2vec, we show that vector databases reveal (almost) as much as their inputs.
Given just vectors (e.g., from a compromised vector database), we show that an adversary can extract sensitive information (e.g., PII) about the underlying text.

Strong Platonic Representation Hypothesis (S-PRH)
We thus strengthen Huh et al.'s PRH to say:
The universal latent structure of text representations can be learned and harnessed to translate representations from one space to another without any paired data or encoders.

https://arxiv.org/abs/2505.12540

BY PONV Daily

Share with your friend now:
tgoop.com/daily_ponv/1704

Open in Telegram

Telegram News

Date: 2025-09-06|

The visual aspect of channels is very critical. In fact, design is the first thing that a potential subscriber pays attention to, even though unconsciously. Concise Your posting frequency depends on the topic of your channel. If you have a news channel, it’s OK to publish new content every day (or even every hour). For other industries, stick with 2-3 large posts a week. 6How to manage your Telegram channel? Telegram users themselves will be able to flag and report potentially false content.
from us

Telegram PONV Daily
FROM American