GenAI articles

Technical articles covering the state-of-the-art for Large Language Models (LLMs) and the emerging world of Multimodal AI. From mastering Graph RAG, to cutting-edge fine-tuning techniques, this section provides the blueprints for building modern generative systems.

Training AI agents with human feedback: A guide to ALHF and modern alignment

Your domain experts know things the model doesn't. ALHF captures their natural-language feedback to improve AI agents without a single retraining cycle.
16 mins read

AI agents in healthcare: Enhancing patient outcomes and streamlining operations

Explore how AI agents are revolutionizing healthcare by enhancing clinical decision-making, personalizing patient care, automating administrative tasks, and addressing key challenges.
10 mins read

Evaluating LLMs in production: From drift detection to continuous monitoring

Learn how to monitor LLMs in production with continuous evaluation, drift detection, trace visibility, and W&B Weave dashboards for reliable LLMOps.
18 mins read

Mastering AI agent observability: From black-box to traceable systems

On this page What is AI agent observability? The shift from, "Is it up?" Agent vs traditional observability For multi-agent systems The 5 pillars of…
9 mins read

Exploring multi-agent AI systems

This article explores multi-agent AI systems, examining how multiple specialized agents collaborate to enhance decision-making, problem-solving, and automation across various domains.
12 mins read

Evaluating autonomous AI agents for performance, oversight, and business value

A blueprint for evaluating AI agents across performance, oversight, and business impact so they don’t implode.
14 mins read

Exploring LLM-as-a-Judge

Learn how LLM-as-a-judge works, when to use it (and when not to), common bias and failure modes, and research-backed best practices for building reliable evaluation systems.
22 mins read

LLM observability: Your guide to monitoring AI in production

Deploying LLM applications into production is complex. This guide explains LLM observability - why it matters, common failure modes like hallucinations, key tool features, and how to get started with W&B Weave.
3 mins read

Generative AI in banking and finance

Generative AI is revolutionizing the financial services industries by automating complex tasks, enhancing customer interactions, and bolstering security. In banking, generative AI models can generate…
16 mins read

AI agents in finance and banking

Explore the shift to agentic AI in finance. Learn how to build safe, autonomous workflows for banking—from fraud detection to compliance—ensuring auditability with W&B Weave.
18 mins read