Import uses wrong namespace for embedder vectors — search returns no results #51
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Problem
When importing a git repo via the Import UI, two different namespaces are used:
znzcybercity— used for the library page, search UI, and disk structure at~/hero/var/books/znzcybercity/cybercity— used for vector storage in hero_embedderThe search page queries the library namespace (
znzcybercity) which has 0 documents, while the vectors sit incybercitywith 2005 documents. Result: "No results found" for every query on a freshly imported library.Steps to Reproduce
https://forge.ourworld.tf/znzcybercity/docs_znzcybercity/library/znzcybercityand search anythingcybercity, notznzcybercityExpected Behavior
The import pipeline should use a single consistent namespace for both the library directory structure and the embedder vector storage.
Solution
Ensure
import_collection_pipeline()uses the same namespace for:scan_collections()namespace assignmentVectorStoreConfignamespace for embedder uploadsThe library namespace (derived from git org) should be the canonical one used everywhere.