Everything you need to get started with Datally
Download sample datasets to test Datally's powerful data consolidation capabilities. These realistic datasets demonstrate common data integration challenges and how Datally solves them. Each dataset includes multiple source files, a reference dictionary, and intentional data quality issues to showcase validation features.
Multi-channel order data from Shopify, Amazon, and WooCommerce
Multi-source employee and payroll data from different HR systems
ADP Workforce Data
Employee records from ADP system
BambooHR Employees
HR data from BambooHR platform
Paychex Roster
Payroll data from Paychex
Workday Export Dictionary
Reference schema from Workday
Datally runs AI models locally on your machine for complete privacy and data security. All processing happens on your hardware - your data never leaves your machine.
Ollama is free and open source. Run powerful AI models locally with complete privacy.
After installation, Ollama runs automatically as a background service on port 11434
Open-source GPT model from OpenAI - The best balance of quality, speed, and accuracy for data validation and mapping tasks.
ollama pull gpt-oss:20bMicrosoft Phi-4 - Strong alternative option (requires 12GB+ VRAM)
ollama pull phi4:14bAlibaba Qwen3 - #1 on MTEB leaderboard for semantic similarity
ollama pull qwen3-embedding:8bAlibaba Qwen3 4B - Faster option with excellent quality
ollama pull qwen3-embedding:4bEven without a GPU, you can run embedding models locally for AI-powered column mapping.
IBM Granite - Optimized for CPU inference
ollama pull granite-embedding:30mIBM Granite 278M - Better quality with 12+ language support
ollama pull granite-embedding:278mollama listollama run gpt-oss:20b "Hello"Join our Design Partner Program and shape the future of data consolidation