Indexing

Not OnPrem

The aiXplain indexing service allows you to store, search, filter, and retrieve your private collections of documents with vector-based semantic search. It is the foundation for building AI search, RAG (retrieval-augmented generation), and domain-specific knowledge systems.

The default indexing and retrieval model on aiXplain is aiR by aiXplain. Your data is securely stored in aiXplain's managed vector database, powered by Qdrant, and is accessible only through your API key. The system supports filtering and ranked results. It also supports multiple types of embeddings for both text and image data.

How it works

You create an index to store your documents.
You can upsert (insert or update) documents into your index.
You can search documents using natural language queries.
You can apply filters based on metadata fields.

Lifecycle

create() → upsert() → count() → search() → get_document() → delete_document() → delete()

note

Use IndexFactory.list() to list all your accessible indexes, based your API key, and IndexFactory.get(<index_id>) to retrieve a specific index.

Creating an index

Creating an Index asset initializes an empty vector database. The private Index will appear in your dashboard and can be accessed at: https://platform.aixplain.com/discover/model/<index.id> You can use this asset to index additional documents using upsert(), or retrieve results using search().

from aixplain.factories import IndexFactory

index = IndexFactory.create(
    name="My First Index",
    description="An index storing random scientific facts",
)

index.id

Parameters:

name (str): Index name.
description (str): Short description.
embedding_model (EmbeddingModel, optional): Selects the model used to embed your data into vector space (default: OPENAI_ADA002).

note

Assets' name and description help agents reason about tool usage and are displayed on asset cards in Discover.

List Embedding Models

List all available text embedding models on aiXplain by filtering models based on their function (TEXT_EMBEDDING). Use the model ID to call the embedding model.

from aixplain.factories import ModelFactory
from aixplain.enums import Function


model_list = ModelFactory.list(function=Function.TEXT_EMBEDDING, page_size=50)["results"]

for model in model_list:
  print(model.id, model.name, model.supplier)

Show output

Example usage: This example uses the Text Embedding Batch model by OpenAI

from aixplain.factories import IndexFactory
from aixplain.enums import EmbeddingModel

index = IndexFactory.create(
    name="Long Technical Documents",
    description="Index optimized for in-depth technical papers and reports",
    embedding_model="6734c55df127847059324d9e"
)

print(index.id)

Index Configuration Options

By default, aiXplain creates an index using the aiR embedding model. This default behavior can be overridden by explicitly specifying indexing parameters using one of the following parameter classes:

Each class allows you to define paramters - name and a description giving you fine-grained control over how your data is embedded and queried.

Parameter Class	Use Case
`AirParams`	Default vector search with aiR embeddings.
`GraphRAGParams`	For graph-enhanced RAG using LLM + embeddings.
`VectaraParams`	Retrieval using Vectara’s optimized search engine.
`ZeroEntropyParams`	High-precision zero-shot indexing and retrieval.

Inserting documents (upsert)

You can insert documents into your index using the Record class.

from aixplain.modules.model.record import Record

records = [
    Record(id="doc1", value="The Earth orbits the Sun once every 365.25 days, creating the calendar year.", attributes={"category": "science"}),
    Record(id="doc2", value="Quantum computing promises to solve complex problems that are currently intractable for classical computers.", attributes={"category": "technology"}),
]

# Upsert documents
index.upsert(records)

Parameters:

id (optional): Document ID. Auto-generated if not provided. Must be unique per document.
value (str): Text content (or extracted content from files).
value_type: Defines the type of the value contained in the Record: DataType.TEXT or DataType.IMAGE (default: text).
attributes (dict): Optional metadata (e.g., author, category, date) for filtering.
uri (optional): If uploading files (e.g., images), you can use uri instead of value, and the file will be processed.

Notes:

When value_type is set to DataType.TEXT, the value field should contain a text string.
When value_type is set to DataType.IMAGE, the uri field must be provided, pointing to an image file or URL.
If a file path or URL is provided for an image and it is valid, the Record automatically adjusts the value_type to DataType.IMAGE.

info

Use the validate method to check if a Record has the correct data type

Searching an index

note

A new index can take up to 5 minutes to return its first results; afterward, searches will be faster.
Because indexes experience cold starts, they may need time to warm up. Upgrade to a paid instance to keep them active 24/7.

Perform a semantic search using natural language queries:

response = index.search("What is AI?")

Parameters:

query (str): Natural language query.
top_k (int, optional): Number of rankedresults (default = 10).
filters (list, optional): List of IndexFilter to filter based on metadata.

response.details # List of top results with scores and metadata

response.data # Top ranking data

response # List of top results with scores and metadata

Show output

ModelResponse fields:

status: The final status of the request (e.g., SUCCESS, FAILURE).
data: The top-ranked retrieved result or output, typically the most relevant match.
details
A list of matched documents with associated information:
- score: Relevance score indicating how closely the document matches the query.
- data: Text snippet or content retrieved from the document.
- document: ID of the document where the match was found.
- metadata: Additional metadata about the document (e.g., category).
completed: Boolean indicating whether the operation completed successfully (true or false).
used_credits: The amount of credits consumed by the operation.
run_time: The total time (in seconds) it took to complete the operation.

Filtering an index

You can also filter your search results by metadata using the IndexFilter class:

from aixplain.modules.model.index_model import IndexFilter, IndexFilterOperator

filters = [
    IndexFilter(field="category", value="technology", operator=IndexFilterOperator.EQUALS),
]

response = index.search(
    query="Artificial Intelligence",
    filters=filters
)

response.details

Available filter operators:

Operator	Meaning
EQUALS (`==`)	Field equals value
NOT_EQUALS (`!=`)	Field does not equal value
CONTAINS (`in`)	Field contains value
NOT_CONTAINS (`not in`)	Field does not contain value
GREATER_THAN (`>`)	Field greater than value
LESS_THAN (`<`)	Field less than value
GREATER_THAN_OR_EQUALS (`>=`)	Field greater than or equal
LESS_THAN_OR_EQUALS (`<=`)	Field less than or equal

note

Filtering by metadata is supported in the IndexModel. Agent-based filters are coming soon.

Retrieving Information

Get Index

from aixplain.factories import IndexFactory

index= IndexFactory.get("<index_id>")

Get document count

index.count()

Show output

Get a specific record by ID

index.get_record("<document_id>").details

Show output

Delete a record by ID

index.delete_record("<document_id>").details

Update record

To update a record, get it's ID and create a new record with the same ID. Then use the upsert method to update the index with the new record.

records = [
    Record(id="doc1", value = "The Earth orbits the Sun,creating a calender year."),
]

index.upsert(records)

Index usage with agents

Indexes can be used directly as tools inside aiXplain agents for retrieval-augmented generation (RAG).

from aixplain.factories import AgentFactory

agent = AgentFactory.create(
    name="Knowledge Assistant",
    description="Answers based on the provided index",
    tools=[
        AgentFactory.create_model_tool(index.id)
    ]
)

agent_response = agent.run("What is quantum computing promising?")

agent_response

Show output

The agent will now search the index automatically when answering questions.

Learn more about agent tools →

Full example

from aixplain.factories import IndexFactory
from aixplain.modules.model.record import Record
from aixplain.modules.model.index_model import IndexFilter, IndexFilterOperator

# Create
index = IndexFactory.create(name="Knowledge Base", description="General knowledge across technology, art, and science.")

# Prepare data
data = [
    {
        "id": "doc1",
        "text": "Artificial intelligence is transforming industries worldwide, from healthcare to finance.",
        "category": "technology"
    },
    {
        "id": "doc2",
        "text": "The Mona Lisa, painted by Leonardo da Vinci, is one of the most famous artworks in history.",
        "category": "art"
    },
    {
        "id": "doc3",
        "text": "Machine learning algorithms are being used to predict patient outcomes in hospitals.",
        "category": "technology"
    },
    {
        "id": "doc4",
        "text": "The Earth orbits the Sun once every 365.25 days, creating the calendar year.",
        "category": "science"
    },
    {
        "id": "doc5",
        "text": "Quantum computing promises to solve complex problems that are currently intractable for classical computers.",
        "category": "technology"
    },
]

# Upsert
records = []
for item in data:
    record = Record(
        id=item["id"],
        value=item["text"],
        attributes={"category": item["category"]}
    )
    records.append(record)

index.upsert(records)

# Search
response = index.search("artificial intelligence")

# Filtered search
filters = [IndexFilter(field="category", value="technology", operator=IndexFilterOperator.EQUALS)]
filtered_response = index.search("computing", filters=filters)

filtered_response

Show output

How it works​

Lifecycle​

Creating an index​

List Embedding Models​

Index Configuration Options​

Inserting documents (upsert)​

Searching an index​

Filtering an index​

Retrieving Information​

Get Index​

Get document count​

Get a specific record by ID​

Delete a record by ID​

Update record​

Index usage with agents​

Full example​

How it works

Lifecycle

Creating an index

List Embedding Models

Index Configuration Options

Inserting documents (upsert)

Searching an index

Filtering an index

Retrieving Information

Get Index

Get document count

Get a specific record by ID

Delete a record by ID

Update record

Index usage with agents

Full example