Ingest anything
Text, CSV, JSON, Markdown, code, images, audio, video — eight parsers and growing. Drop a folder; it understands the contents.
Pre-release · v0.1 in development
Drop any file. Find anything. Run AI locally.
No account. No cloud. No telemetry. After your chosen models are
installed, no internet is required at runtime.
Try-in-browser and desktop installers ship at v0.1 launch.
app.mydatasights.app
Inputs
📄 air-quality.csv
📊 electrical-3phase.csv
📚 books-library.csv
🎲 boardgames.csv
Insights
⚡ PM2.5 breach detected — bin 14:00–15:00
📈 RADON ~ NO2 correlate (r=0.71)
🔍 Profile: air-quality (confidence 0.94)
🧮 Drift on column “temp”: stable → drifted
Models
⓿ Phi-3-mini-Q4
⓿ MiniLM-L6
⓿ Whisper-tiny
⓿ all on-device
Mockup of the v0.1 workspace — interactive preview at launch.
Text, CSV, JSON, Markdown, code, images, audio, video — eight parsers and growing. Drop a folder; it understands the contents.
Full-text plus embeddings plus tags plus facets — one search box, four index layers, ranked together.
Time-series, heatmaps, latent projections, mind-maps, timelines. The right view for the data, picked automatically.
Sentiment, NER, summarisation, OCR, chat. Models live on your device. Nothing leaves it.
Plugins are folders with a manifest. Drop one in — parser, viewer, analyser, view. No rebuild required.
No server, no analytics, no account. Even crash reports stay in your browser until you choose to export them.
v0.1 is in active development — track progress in the project status doc on GitHub. The fastest way to be notified at launch is to Watch → Releases only on the repository. No mailing list, no signup form: we hold no email addresses we don't have to.
Only AI model files, and only when you choose to download one. After that, everything — your files, your searches, your analyses — stays on-device. See privacy for the exact list.
No. There is no telemetry, no cookies, no tracking pixels. The only server-side data is your IP in Cloudflare's access logs while you visit this page, retained briefly for abuse prevention.
The browser version runs the same app, but is limited by browser storage quotas and cannot reach files outside what you drag in. The desktop version (Tauri) can read your filesystem, has larger storage, and supports bigger AI models like the 2 GB Phi-3 chat model.
Yes. The open-source core is Apache-2.0 — no account required, no SaaS upsell. An optional paid Pro edition with extra plugins (encrypted workspace, advanced analyzers) is planned for v1.0; the free core will always remain fully functional on its own. Sponsoring on GitHub is welcome but optional.
Yes. The Apache-2.0 license permits commercial use; see the license. Outputs are advisory and you should verify before relying on them for any business decision.
The app surfaces patterns and statistical estimates, not authoritative truth. Models can be wrong — sometimes confidently — on small samples, ambiguous text, or data outside their training domain. Do not rely on outputs for medical, legal, financial, or safety-critical decisions.
Email [email protected]. Please don't open public issues for vulnerabilities.
No email signup. Just a GitHub watch for release notifications.