Embedding-compass

๐ŸŒ The Embedding Compass: Cultural Geometry in Multilingual AI

Python License: MIT

Research Question: Do multilingual embedding models encode culturally-specific semantic relationships between abstract moral concepts?

Answer: Yes. The geometric relationships between moral concepts vary significantly across languages, suggesting that embedding models absorb cultural values from training data.


๐Ÿ”ฌ Key Finding

The relationship between justice, mercy, and punishment differs by 21% across languages: This difference is statistically significant (F=48.0, p<0.0001).

Language Ratio* Interpretation
๐Ÿ‡ฏ๐Ÿ‡ต Japanese 1.279 Justice and mercy closely related
๐Ÿ‡ฌ๐Ÿ‡ง English 1.290 Balanced relationship
๐Ÿ‡จ๐Ÿ‡ณ Chinese 1.308 Moderate separation
๐Ÿ‡ธ๐Ÿ‡ฆ Arabic 1.476 Greater conceptual distance
๐Ÿ‡ฎ๐Ÿ‡ณ Hindi 1.549 Justice and mercy are distinct concepts

*Ratio = distance(justiceโ†’mercy) / distance(justiceโ†’punishment)

Statistical Validation:

Methodology

Model: paraphrase-multilingual-mpnet-base-v2 (768-dimensional embeddings)

Languages: English, Hindi, Japanese, Arabic, Chinese

Concepts: 10 abstract moral terms (justice, mercy, duty, honor, forgiveness, punishment, law, freedom, loyalty, sacrifice)

Metric: Cosine distance = 1 - cosine_similarity

Statistical Test: One-way ANOVA with bootstrap resampling (n=20 per language)

Visualization: Interactive Plotly HTML charts

๐Ÿ“Š What This Means

Cultural Insight

AI Implications

Key Charts

Justice-Mercy Ratio Comparison

Ratio Comparison Shows the 21% variation in concept relationships across languages

Hindi Concept Heatmap

Hindi Heatmap Visualizes how moral concepts cluster in Hindi embeddings

Japanese Concept Heatmap

Japanese Heatmap Visualizes how moral concepts cluster in Japanese embeddings

License: MIT Python 3.8+ Open In Colab

๐Ÿš€ Quick Start

Open In Colab

  1. Click the badge above
  2. Click Runtime โ†’ Run all
  3. Results appear in ~5 minutes

Option 2: Run Locally

```bash

Clone repository

git clone https://github.com/SShreeya-Das/Embedding-compass.git cd embedding-compass

Install dependencies

pip install -r requirements.txt

Run analysis

python analysis.py # (if you created this file)

OR open the notebook in Jupyter