Description
- Dataiku features basics to advanced
Dataiku is an end‑to‑end AI and data platform that supports visual data prep, code notebooks, AutoML, and enterprise governance.
- Visual data preparation — drag‑and‑drop recipes for cleaning, joining, and shaping datasets.
- Code notebooks — run Python, R, and SQL interactively alongside visual flows.
- AutoML — automated model selection, tuning, and explainability for rapid prototyping.
- Recipe library — prebuilt transformations and connectors to speed pipeline development.
- Data engineering — scalable pipelines with Spark, Dask, and integrations for Databricks and Snowflake.
- Feature engineering — built‑in tools and visual recipes to create and manage ML features.
- Experiment tracking — track runs, metrics, and model versions for reproducibility.
- MLflow integration — native support for experiment tracking, model packaging, and deployment.
- Deployment and scoring — one‑click model deployment to REST endpoints or batch scoring jobs.
- Automation and scenarios — schedule, monitor, and alert on production pipelines and jobs.
- Governance and collaboration — project workspaces, role‑based access, and audit trails for enterprise use.
- Data quality and testing — built‑in checks, assertions, and test recipes to validate datasets.
- Catalog and lineage — metadata, lineage tracking, and searchable datasets for discoverability.
- Scalability — run on local, on‑prem, or cloud infrastructures with elastic compute options.
- Extensibility — plugins, custom code, and API hooks to integrate with existing toolchains.
- Advanced analytics — support for time series, deep learning, and production feature stores.




