How do I search Mem0 memories using AI agents?

Use the mem_0_memories_search tool provided by Truto. The AI agent must construct a payload that nests entity IDs (like user_id) inside a 'filters' object to successfully query the hybrid retrieval engine.

How do I update an existing memory in Mem0 using an LLM?

Because the Mem0 ingestion pipeline is additive, the agent must first use the Search tool to find the specific memory ID, and then call the update_a_mem_0_memory_by_id tool to mutate the record without creating duplicates.

Can an AI agent delete multiple memories at once?

Yes. Agents can use the mem_0_memories_bulk_delete_by_filter tool to wipe all records associated with specific nested filter criteria, making it highly effective for automating GDPR DSAR requests.

Connect Mem0 to AI Agents: Bulk Edit, Export, and Control User Data

You want to connect Mem0 to an AI agent so your system can autonomously manage persistent user memory, enforce data privacy via bulk deletions, and trigger structured memory exports based on historical context. Here is exactly how to do it using Truto's /tools endpoint and SDK, bypassing the need to hardcode API wrappers or manage complex agent state manually.

If your team uses ChatGPT, check out our guide on connecting Mem0 to ChatGPT, or if you are building on Anthropic's models, read our guide on connecting Mem0 to Claude. For developers building custom autonomous workflows, you need a programmatic way to fetch these tools and bind them to your agent framework.

This guide breaks down exactly how to fetch AI-ready tools for Mem0, bind them natively to an LLM using LangChain (or any framework like LangGraph, CrewAI, or Vercel AI SDK), and execute complex persistent memory workflows. For a deeper look at the architecture behind this approach, refer to our research on architecting AI agents and the SaaS integration bottleneck.

The Engineering Reality of Custom Mem0 Connectors

Building AI agents is easy. Connecting them to external state management and memory APIs is hard. Giving an LLM access to external data sounds simple in a prototype. You write a Node.js function that makes a fetch request and wrap it in an @tool decorator. While this covers the basics of LLM function calling, in production, this approach collapses entirely.

If you decide to build a custom integration for Mem0, you own the entire API lifecycle. Mem0's API introduces several specific integration challenges that break standard LLM assumptions.

The Entity Filter Nesting Trap

When an LLM attempts to search for a specific user's memories, its default behavior is to construct a flat JSON payload. It will attempt to send {"user_id": "user_123", "query": "preferences"}. The Mem0 API explicitly rejects this. Entity IDs (such as user_id, agent_id, app_id, and run_id) cannot be passed at the top level. They must be nested strictly inside a filters body object, and often require logical operators.

If you hand-code these tools, you will spend days writing elaborate system prompts trying to force the LLM to format the payload correctly. Truto solves this by generating a strict, heavily annotated JSON schema via the /tools endpoint that acts as a guardrail, forcing the LLM to output the nested filter structure natively.

The V3 Additive Pipeline vs. Mutative Updates

Mem0 operates on a V3 additive pipeline for memory ingestion. When you hit the memory creation endpoints, the system is async and append-only. Nothing is overwritten. If an agent learns that a user changed their dietary preference from "Vegetarian" to "Vegan" and simply calls the create endpoint again, you now have conflicting memories stored in the vector database.

To update state correctly, the agent must execute a multi-step sequence: search for existing memories using hybrid retrieval, extract the specific memory id, and explicitly call the update or delete endpoints. Your tool definitions must clearly delineate these operational boundaries, or your agent will pollute the memory store.

Async Events and Rate Limit Pass-Through

Mem0 handles heavy operations - like structured exports or large memory bulk operations - asynchronously. Submitting an export job does not return the data; it returns an event_id. The agent must know to poll the event status.

Furthermore, when querying at scale, you will hit rate limits. It is critical to understand the factual reality of how Truto handles this: Truto does not retry, throttle, or apply backoff on rate limit errors. When the Mem0 API returns an HTTP 429, Truto passes that exact error directly to the caller. Truto normalizes the upstream rate limit information into standardized headers (ratelimit-limit, ratelimit-remaining, ratelimit-reset) per the IETF specification. The caller (your agent framework) is entirely responsible for implementing retry and backoff logic. Do not expect an infrastructure layer to absorb rate limits silently, as that masks critical system feedback from the LLM.

Fetching Mem0 AI Agent Tools via Truto

Every integration on Truto maps underlying product APIs into a REST-based CRUD abstraction called Proxy APIs. These Proxy APIs handle pagination, authentication, and query parameter processing, returning data in a predefined format.

Truto provides all the resources defined on an integration as tools for your LLM frameworks. If you have explored building MCP servers for AI agents, you will recognize this pattern of exposing standardized toolsets via a single endpoint. By making a single GET request to https://api.truto.one/integrated-account/<id>/tools, you retrieve a comprehensive list of Proxy APIs formatted specifically as LLM tools, complete with JSON Schema definitions and descriptions.

This removes the need to write custom OpenAPI specs or Pydantic models for Mem0. You authenticate the tenant once, get the integrated account ID, and immediately pull down the required toolkit.

Mem0 AI Agents Integration: Hero Tools

Instead of overwhelming your LLM context window with 50 different endpoints, you should filter your /tools request to provide only the most highly leveraged operations. Here are the core hero tools required for autonomous Mem0 management.

Create a Mem0 Memory

This tool allows the agent to ingest new contextual data into the V3 additive pipeline. Because it is append-only, agents should use this strictly for new observations or conversation logs.

Usage notes: Requires a messages array and at least one entity identifier (e.g., user_id or agent_id). Returns an event_id for tracking the async processing.

"The user mentioned they are moving to New York next month. Store this in Mem0 for user_id 'u_8829' so we can reference it in future real estate recommendations."

Mem0 Memories Search

Standard list operations are insufficient for RAG workloads. This tool triggers Mem0's hybrid retrieval engine, combining semantic search, BM25 keyword matching, and entity matching to return relevance-scored results.

Usage notes: The LLM must construct a query string and provide a nested filters object containing the target entity ID.

"Search Mem0 for any historical preferences regarding deployment environments for org_id 'org_551'. I need to know if they prefer AWS or GCP before I generate this Terraform script."

Update a Mem0 Memory by ID

When a user's state changes, appending a new memory creates conflicts. This tool allows the agent to overwrite specific memory content or metadata directly.

Usage notes: The agent must first use the Search tool to retrieve the target memory id, then pass that id along with the updated payload to this tool.

"The user just clarified that their budget has increased from $50k to $75k. Find the existing memory about their budget and update the content to reflect the new $75k limit."

Mem0 Memories Bulk Delete by Filter

Critical for GDPR compliance and Data Subject Access Requests (DSARs). This tool allows an agent to instantly wipe all persistent memory associated with specific criteria across the entire data store.

Usage notes: Requires careful construction of the nested filter object. Returns an empty 204 response on success.

"User 'u_9921' has requested account deletion under GDPR. Execute a bulk delete filter across the Mem0 storage to wipe all memory records tied to this user_id immediately."

Create a Mem0 Export

Allows the agent to trigger a structured export job, distilling messy conversation histories into a clean, custom Pydantic schema for external BI processing or reporting.

Usage notes: This triggers an async job. The agent must capture the returned export_id and use the Get Export tool later to retrieve the actual data payload.

"Generate a structured memory export for all interactions tied to run_id 'r_001' over the last 30 days. Format the output schema to extract 'primary_complaints' and 'feature_requests'."

To view the complete inventory of available Mem0 tools, required parameters, and detailed schema definitions, visit the Mem0 integration page.

Building Multi-Step Workflows

To build a resilient AI agent, you must orchestrate the interaction between the LLM and the tools. Using LangChain and Truto's SDK, we can dynamically bind the Mem0 tools to the model and establish an execution loop.

Crucially, this loop must handle the HTTP 429 rate limit errors that Truto passes through. The agent framework must read the ratelimit-reset header and pause execution accordingly.

sequenceDiagram
    participant LLM as Agent LLM (LangChain)
    participant Truto as Truto Tools API
    participant Mem0 as Mem0 Upstream
    
    LLM->>Truto: GET /integrated-account/{id}/tools
    Truto-->>LLM: Returns JSON Schema for Mem0 tools
    Note over LLM: Agent binds tools via .bindTools()
    
    LLM->>Truto: Execute mem_0_memories_search
    Truto->>Mem0: Proxy hybrid search request
    Mem0-->>Truto: 200 OK (Memory results)
    Truto-->>LLM: Return memory context
    
    LLM->>Truto: Execute update_a_mem_0_memory_by_id
    Truto->>Mem0: Proxy update request
    Mem0-->>Truto: 429 Too Many Requests
    Truto-->>LLM: HTTP 429 + ratelimit-reset header
    
    Note over LLM: Agent reads header, pauses execution
    LLM->>Truto: Retry update_a_mem_0_memory_by_id
    Truto->>Mem0: Proxy update request
    Mem0-->>Truto: 200 OK
    Truto-->>LLM: Success confirmation

Here is how you implement this framework-agnostic setup using TypeScript and LangChain:

import { ChatOpenAI } from "@langchain/openai";
import { AgentExecutor, createToolCallingAgent } from "langchain/agents";
import { ChatPromptTemplate } from "@langchain/core/prompts";
import { TrutoToolManager } from "truto-langchainjs-toolset";
 
async function executeMem0AgentWorkflow(prompt: string, accountId: string) {
  // 1. Initialize the LLM
  const llm = new ChatOpenAI({ modelName: "gpt-4-turbo", temperature: 0 });
 
  // 2. Initialize Truto Tool Manager for the specific tenant
  const toolManager = new TrutoToolManager({
    apiKey: process.env.TRUTO_API_KEY,
    accountId: accountId,
  });
 
  // 3. Fetch Mem0 tools from Truto's /tools endpoint
  const tools = await toolManager.getTools();
 
  // 4. Define the system prompt instructing the agent on API quirks
  const promptTemplate = ChatPromptTemplate.fromMessages([
    ["system", "You are an autonomous memory management agent. When querying Mem0, ALWAYS nest entity identifiers (like user_id) inside a 'filters' object. Never pass them at the top level. If you receive a rate limit error, inform the user."],
    ["human", "{input}"],
    ["placeholder", "{agent_scratchpad}"],
  ]);
 
  // 5. Bind tools to the agent
  const agent = createToolCallingAgent({
    llm,
    tools,
    prompt: promptTemplate,
  });
 
  const agentExecutor = new AgentExecutor({
    agent,
    tools,
    maxIterations: 10,
    // Framework-level error handling for 429s passed through by Truto
    handleParsingErrors: true,
  });
 
  try {
    const result = await agentExecutor.invoke({ input: prompt });
    console.log(result.output);
  } catch (error) {
    if (error.status === 429) {
      const resetTime = error.headers['ratelimit-reset'];
      console.error(`Rate limit exceeded. Caller must backoff until: ${resetTime}`);
      // Implement your application-level backoff queue here
    } else {
      console.error("Workflow failed:", error);
    }
  }
}

This architecture guarantees that the LLM is always working with the most up-to-date API schemas. When Mem0 adds new filtering capabilities or updates their V3 pipeline, Truto updates the definitions on the backend. The next time your agent calls the /tools endpoint, it instantly inherits the new capabilities without requiring code redeployments on your end.

Workflows in Action

By chaining these tools together, your agents can execute complex, multi-stage state management workflows autonomously. Here are two real-world examples of how AI agents interact with Mem0 in production environments.

1. Automated DSAR (Right-to-be-Forgotten) Execution

When a user requests account deletion, compliance teams often scramble to manually scrub vector databases and unstructured memory logs to avoid regulatory fines. An AI agent can handle this entire lifecycle instantly.

"User 'david.miller@example.com' (user_id: 8472) has submitted a GDPR deletion request. Please find all projects they belong to, remove them as a project member, and then permanently delete all their historical memories from the datastore."

Step-by-step Execution:

The agent calls list_all_mem_0_projects to retrieve the organization's active projects.
It loops through the projects, calling list_all_mem_0_project_members to check if david.miller@example.com is attached.
Upon finding a match, it calls delete_a_mem_0_project_member_by_id to revoke access.
Finally, it executes mem_0_memories_bulk_delete_by_filter, passing {"filters": {"user_id": "8472"}} to ensure zero data retention across the memory layer.

Outcome: The agent confirms the user has been fully excised from both project access lists and the underlying LLM memory store, leaving behind a clean audit trail.

2. Contextualizing and Appending Support History

Customer success agents often lack context when entering a ticket. An AI agent can run in the background, summarizing previous pain points and updating the user's persistent profile.

"A new high-priority ticket just came in for org_id 'acme_corp'. Search Mem0 for their recent complaints regarding 'latency' or 'downtime'. If they have a history of these issues, update their main account memory to tag them as 'churn_risk' and summarize the findings."

Step-by-step Execution:

The agent calls mem_0_memories_search, passing the query "latency OR downtime" and nesting {"org_id": "acme_corp"} in the filters block.
It reads the returned hybrid search results, identifying three severe complaints over the past month.
It calls create_a_mem_0_memory to append a new synthesized insight detailing the recurring latency issues.
It queries the system again to find the core account metadata memory, retrieves its id, and uses update_a_mem_0_memory_by_id to inject the "tags": ["churn_risk"] array.

Outcome: The agent proactively enriches the customer's profile, ensuring that the next time an LLM interacts with this user, it treats them with high sensitivity based on the newly injected churn_risk tag.

Moving Beyond Manual Integration Wrappers

Connecting Mem0 to AI agents shouldn't require your engineering team to spend weeks managing OpenAPI specs, debugging nested JSON filters, or tracking down undocumented API behaviors.

By leveraging Truto's /tools endpoint, you delegate the heavy lifting of API normalization and schema generation to the infrastructure layer. Your developers can focus entirely on prompt engineering, agent orchestration, and building robust backoff queues for rate limits, rather than writing boilerplate integration code.