Glossary

    What Is Semantic Faceting?

    Semantic faceting is the first stage of the Green Vectors process. Before the system decides what to store, it identifies the meaning-bearing concepts inside a corpus and groups related semantic signal into facets. Where traditional pipelines treat chunks as the basic unit of retrieval, Green Vectors treats meaning as the basic unit, so redundant fragments, boilerplate, duplicates, and weak signals never become retrieval clutter.

    Why semantic faceting matters

    Most retrieval pipelines store and search at the level of text chunks, which means duplicates, boilerplate, and low-signal fragments all become part of the index and compete with meaningful content. By organizing around meaning-bearing concepts first, semantic faceting keeps clutter out of the index from the start, which improves both efficiency and relevance.

    How it relates to vector reduction

    Semantic faceting is what makes vector reduction possible. By grouping related semantic signal into facets rather than storing every fragment, the system stores far fewer vectors while preserving the meaning that matters.

    FAQ

    Frequently asked questions.

    No. Chunking divides text into pieces. Semantic faceting organizes meaning-bearing concepts into facets, treating meaning rather than text fragments as the basic unit of retrieval.
    By grouping related semantic signal into facets instead of storing every redundant fragment, it keeps duplicates and low-signal content out of the index.

    Related concepts

    See Green Vectors in action

    Kitana is in closed beta. Drop Green Vectors into your existing vector database and benchmark against your own workload.

    Request Kitana beta access