نيورا

Benchmark

LongMemEval · Hit@10
Recall accuracy
79%
+37.5 pts vs baseline
Baseline recall
41.5%
embedding-only
Relative gain
1.9×
near 2× recall
Metric
Hit@10
LongMemEval
Recall across runs

Memory retrieval trend

70
80
90
41.5
79
baselineneura
Run config

Suite

Probe setLongMemEval
MetricHit@10
Baseline41.5% (embedding-only)
MethodBM25 + embeddings + weighted RRF + temporal re-ranking
Measured2026-06

Hit@10 measures how often the right memory is surfaced in the top 10 recalled items across long, multi-session conversations. Neura nearly doubled recall over the embedding-only baseline.

مُقاس على LongMemEval (2026-06). الطريقة: BM25 + embeddings + weighted RRF + temporal re-ranking.