Responsible AI

Article
Responsible AI
Our approach to trustworthy AI: Using frameworks to manage risk and accelerate development
By using tools like risk management frameworks, we identify and address AI risks early, saving time and cost while ensuring our clients’ AI solutions are built on a foundation of trust and reliability.
Article
Responsible AI
Developing input filters for chatbot security: How traditional machine learning shows the need for diverse training data
Our research into chatbot performance using traditional ML shows how input filters increasingly need tailored training data to identify security risks.
Responsible AI
Identifying compound adversarial attacks with unsupervised learning
This case study shows how compound adversarial attacks can be identified using unsupervised learning to overcome limited training data.
Article
Article
Responsible AI
How to define system prompt exfiltration attacks in LLM-based applications
System prompt exfiltration is among the most alarming of LLM attacks. We propose a definition to make prompt exfiltration attacks easier to identify.
Article
Responsible AI
AI blue teaming to protect against jailbreaks: LLM-generated guardrails for chatbot input filtering
Input filters are a blue teaming operation and essential to building safe, secure LLMs.
Article
Responsible AI
From benchmarks to red teaming: Ensuring robust and responsible AI
Learn how to adopt a mindset of continuous evaluation in generative AI, exploring popular benchmarks and AI red teaming methods.
Article
Responsible AI
Next-level AI red reaming: How to automatically categorize LLM attack methods using named entity recognition (NER)
Learn how to automate the evaluation and categorization of LLM attack methods so your AI red team ensures good test coverage and finds vulnerabilities.
Responsible AI
How to build responsible AI practices into your organization
As GenAI implementations become more prominent, it's critical to adhere to responsible AI practices to protect your brand and foster customer trust.
Article
Article
Responsible AI
How to design trustworthy AI products for healthcare: Five design principles
Learn how these five artificial intelligence design techniques build trust in highly regulated industries like healthcare.
Article
Responsible AI
Detecting AI hallucination risk using a CIA technique
A CIA technique called a canary trap helps us detect AI hallucination risk in large language models (LLMs) enhanced with retrieval augmented generation (RAG).
Article
Responsible AI Financial Services & Fintech
Partial intent classification: How we made a safe conversational AI assistant for a financial services firm
Intent classification used in concert with a large language model (LLM) and retrieval-augmented generation (RAG) system resulted in a safer financial chatbot.
Article
Responsible AI
Three ways to prevent AI hallucinations
Boost AI reliability by preventing AI hallucinations with WillowTree's three-pronged approach to minimize and mitigate incorrect information produced by LLMs.

Be the first to know

Get curated content delivered right to your inbox. No more searching. No more scrolling.

Subscribe now

Insights Overview

Categories

Industries

Resource Types

Responsible AI

Our approach to trustworthy AI: Using frameworks to manage risk and accelerate development

Developing input filters for chatbot security: How traditional machine learning shows the need for diverse training data

Identifying compound adversarial attacks with unsupervised learning

How to define system prompt exfiltration attacks in LLM-based applications

AI blue teaming to protect against jailbreaks: LLM-generated guardrails for chatbot input filtering

From benchmarks to red teaming: Ensuring robust and responsible AI

Next-level AI red reaming: How to automatically categorize LLM attack methods using named entity recognition (NER)

How to build responsible AI practices into your organization

How to design trustworthy AI products for healthcare: Five design principles

Detecting AI hallucination risk using a CIA technique

Partial intent classification: How we made a safe conversational AI assistant for a financial services firm

Three ways to prevent AI hallucinations

Be the first to know