In the Distill ecosystem, Artifacts serve as the foundational “knowledge base” or “brain” for your AI operations. While Datasets handle structured numerical information (the “calculator”), Artifacts are designed to manage unstructured data—the vast repositories of text-based information that define your business processes, compliance standards, and technical expertise.

The Role of Artifacts in Retrieval-Augmented Generation (RAG)

Distill utilizes a sophisticated Retrieval-Augmented Generation (RAG) architecture. Instead of relying on general knowledge, the AI refers specifically to your uploaded Artifacts to formulate answers. This ensures that every response is factually grounded in your specific business context, significantly reducing the risk of “hallucinations” or generic advice.

The “Ask” Widget Integration

The primary way to interact with your Artifacts is through the “Ask” widget on your Insight Dashboards. Key features include:

System Instructions: You can “prime” the AI by giving it a persona (e.g., “Answer as a senior safety inspector”) to tailor the tone and depth of its responses.
Scoped Searching: You can limit the AI’s “search area” to specific folders within your Artifacts library, ensuring it only looks at the most relevant manuals or SOPs for a given dashboard.

Management and Ingestion Workflows

Managing your Artifacts library is designed for both speed and organizational flexibility.

Uploading New Artifacts

Direct Upload: Within the Insight Distillery > Artifacts tab, you can simply drag and drop folders of documents (PDFs, .txt, docx) directly into the interface.
From Files & Docs: You can also migrate existing documentation stored in the Appenate platform by selecting the desired documents in the Files & Docs area and choosing “Send selected Document(s) to Distill.”

Technical Data Processing

Once an Artifact is uploaded, the Distill backend performs several automated steps to prepare it for AI interaction:

Cleaning & Pre-processing: Extraneous formatting is removed to extract pure text content.
Intelligent Chunking: Large documents are segmented into smaller, manageable “chunks.” These chunks include overlaps to preserve semantic meaning across artificial boundaries.
Embedding Generation: Each text chunk is converted into a vector embedding—a high-dimensional numerical representation of its semantic meaning—using Google’s Gemini models.
Similarity Search: When you ask a question, the system finds the specific chunks in your library that mathematically most closely match your query.

Summary of Artifact Capabilities

Feature	Functional Description
Content Type	Unstructured data such as PDFs, SOPs, manuals, and project logs.
Core Utility	Provides the “Source of Truth” for the AI’s RAG-based responses.
Interaction Point	The “Ask” widget on Insight Dashboards.
Search Scope	Configurable at the widget level to target specific document folders.

Data Security and Privacy

Security is integrated into the Artifacts workflow by design:

Regional Isolation: Traffic remains within your nominated Appenate node (US, EU, or AU), utilizing regional Gemini Pro instances.
Stateless Processing: AI context is discarded immediately after each interaction is complete.
Account Isolation: Your Artifacts are strictly isolated to your organization; they are never used to train global AI models.

Understanding Distill Artifacts

Understanding Distill Artifacts

The Role of Artifacts in Retrieval-Augmented Generation (RAG)

The “Ask” Widget Integration

Management and Ingestion Workflows

Uploading New Artifacts

Technical Data Processing

Summary of Artifact Capabilities

Data Security and Privacy

Related Articles

Understanding Distill Datasets

Introduction to Distill

Introduction to Distill Dashboards

Navigating the Dashboard Designer

Choosing the Right Visualization: Charts & Widgets