Understanding Native Tools in Bud Agent Builder: Web Search

Jun 9, 2026 | By Bud

How it works

When Web Search is enabled on a prompt version, the agent’s model can decide to call the tool with a query string. The tool runs the search against a DuckDuckGo-backed index and returns a clean, structured list of results that the model can summarise, cite, or follow up on by fetching the full page with the companion Web Fetch tool.

Each result is a small, transparent object with three fields:

  • title — the page title as indexed
  • href — the destination URL (citation-ready)
  • body — a short snippet, typically one to three sentences

The configuration surface is deliberately tiny. There is a single optional field, max_results, which caps how many results come back; left unset, the tool returns the first batch. There are no API keys to manage and no per-project quotas to provision, because the DuckDuckGo backing means there is nothing to set up. Typical responses complete in one to three seconds.

One design choice worth calling out: returned snippets are treated as untrusted input by default. Downstream prompts are not meant to interpret them as instructions, which hardens agents against the classic “a poisoned search result hijacks the model” attack. That posture is built into the tool’s guidance rather than something you have to remember to add.

Bud

Bud

Related Blogs

Introducing SIMD-Bench: An Open-Source Framework for Cross-Architecture Benchmarking, Profiling, and Improving SIMD Kernels
Introducing SIMD-Bench: An Open-Source Framework for Cross-Architecture Benchmarking, Profiling, and Improving SIMD Kernels

We open-sourced SIMD-Bench, an open-source framework that benchmarks and profiles SIMD kernels to evaluate and compare their performance across different instruction set architectures (ISAs), CPU platforms and CPU vendors. Unlike existing profilers that focus on single platforms or shallow metrics, SIMD-Bench combines reproducible cross-architecture benchmarking, roofline and top-down microarchitectural analysis, correctness validation, and energy metrics […]

Why Use FCSP If GPUs Already Support MIG?
Why Use FCSP If GPUs Already Support MIG?

If you’ve ever tried to share a GPU between multiple users or workloads in a Kubernetes cluster, you’ve probably heard of NVIDIA’s Multi-Instance GPU (MIG) technology. It’s the official, hardware-backed solution for GPU partitioning. But what if I told you there’s a compelling software alternative that might actually be better for your use case? Enter FCSP (Fixed […]

How to Build vLLM Plugins: A comprehensive Developer Guide with tips and best practices
How to Build vLLM Plugins: A comprehensive Developer Guide with tips and best practices

Building plugins for vLLM allows you to tailor the system to your specific requirements and integrate custom functionality into your LLM workflows. Whether you’re looking to integrate custom functionality, optimize performance, or streamline deployment, understanding how vLLM’s plugin system works is essential. In this comprehensive developer guide, I’ll break down the core concepts, walk through […]

Fixed Capacity Spatial Partition, FCSP : GPU Resource Isolation Framework for Multi-Tenant ML Workloads
Fixed Capacity Spatial Partition, FCSP : GPU Resource Isolation Framework for Multi-Tenant ML Workloads

GPU sharing in multi-tenant cloud environments requires efficient resource isolation without sacrificing performance. We present FCSP (Fixed Capacity Spatial Partition), a user-space GPU virtualization framework that achieves sub-microsecond memory enforcement and deterministic compute throttling through lock-free data structures and hierarchical token bucket rate limiting. Unlike existing solutions that rely on semaphore-based synchronization, FCSP employs C11 […]