Resources

Everything you need to get started with Datally

Demo Datasets

Download sample datasets to test Datally's powerful data consolidation capabilities. These realistic datasets demonstrate common data integration challenges and how Datally solves them. Each dataset includes multiple source files, a reference dictionary, and intentional data quality issues to showcase validation features.

E-Commerce Orders Dataset

Multi-channel order data from Shopify, Amazon, and WooCommerce

What's Included:

35 orders across 3 sales channels
Shopify, Amazon, WooCommerce exports
Reference dictionary schema
15+ data quality issues
Different column naming patterns
Value translation examples

Perfect For Learning:

  • • AI-powered column mapping across different schemas
  • • Value translation (order statuses, payment methods)
  • • Data validation (email formats, date plausibility)
  • • Multi-source data consolidation

HR Payroll Dataset

Multi-source employee and payroll data from different HR systems

What's Included:

ADP Workforce Data

Employee records from ADP system

BambooHR Employees

HR data from BambooHR platform

Paychex Roster

Payroll data from Paychex

Workday Export Dictionary

Reference schema from Workday

Perfect for Testing:

  • • Multi-source data consolidation
  • • Column mapping across different schemas
  • • Data validation and quality checks
  • • Handling inconsistent formats and naming conventions

Local AI Setup with Ollama

Datally runs AI models locally on your machine for complete privacy and data security. All processing happens on your hardware - your data never leaves your machine.

✅ Works on All Hardware

With GPU (4GB+ VRAM): Full AI features including LLM models
CPU-Only: Embedding models for intelligent mapping

Step 1: Install Ollama (Windows)

Ollama is free and open source. Run powerful AI models locally with complete privacy.

⚡ Recommended: Windows Package Manager

winget install --id=Ollama.Ollama -e

Alternative: Manual Download

Download Ollama for Windows

After installation, Ollama runs automatically as a background service on port 11434

Step 2: Download Recommended Models

💻 For Systems with GPU (8GB+ VRAM)

Recommended Language Models:
gpt-oss:20b⭐ Best Overall

Open-source GPT model from OpenAI - The best balance of quality, speed, and accuracy for data validation and mapping tasks.

ollama pull gpt-oss:20b

IBM Granite 3.3 - Fastest option with excellent accuracy

ollama pull granite3.3:8b
phi4:14bHigh Performance

Microsoft Phi-4 - Strong alternative option (requires 12GB+ VRAM)

ollama pull phi4:14b
Recommended Embedding Models:

Alibaba Qwen3 - #1 on MTEB leaderboard for semantic similarity

ollama pull qwen3-embedding:8b

Alibaba Qwen3 4B - Faster option with excellent quality

ollama pull qwen3-embedding:4b

💻 For CPU-Only / Integrated GPU Systems

Even without a GPU, you can run embedding models locally for AI-powered column mapping.

IBM Granite - Optimized for CPU inference

ollama pull granite-embedding:30m

IBM Granite 278M - Better quality with 12+ language support

ollama pull granite-embedding:278m

Step 3: Verify Installation

Check installed models: ollama list
Test a model: ollama run gpt-oss:20b "Hello"
Launch Datally and configure AI models in Settings

Documentation & Support

Getting Started Guide

Learn how to use Datally with step-by-step tutorials and best practices.

Coming soon

Technical Support

Need help? Our support team is here to assist you.

Ready to Get Started?

Join our Design Partner Program and shape the future of data consolidation