Apache Texera (Incubating)

Apache Texera (Incubating) is an open-source platform for human-AI collaborative data science using visual workflows. It enables analysts to construct, execute, and refine data analysis tasks through an intuitive GUI, assisted by AI agents that understand natural-language instructions. Texera is well-suited for a wide range of applications, including “AI for Science,” by making advanced AI and data science capabilities accessible to a broader community. It can run on a laptop for local use or be deployed in the cloud to support scalable processing of large datasets.

Operator 01

Human-AI Collaborative Data Science

Pending
ai-agents chatbots natural-language

Conduct data science through AI agents using natural language. Texera lets you build, refine, and execute analytical workflows conversationally — no coding required.

Operator 02

Agent-Powered Visual Workflows

Pending
drag-drop visual low-code

Build end-to-end data science workflows through an intuitive graphical interface. Drag-and-drop operators, connect them visually, and inspect results as you go.

Operator 03

Real-Time Collaboration

Pending
collaborative-editing shared-execution multi-user

Collaborate on workflows like Google Docs, but for data science. Multiple users can co-edit workflows and share executions in real time within Texera.

Operator 04

Scalable Parallel Engine

Pending
parallel-engine big-data scalable

Process large volumes of data with Texera's parallel backend engine. Designed for high-performance, scalable execution across big-data workloads.

Operator 05

Language-Agnostic Runtime

Pending
python java user-defined-functions

Texera's workflow runtime is language-agnostic, with native support for Python and Java. Extend pipelines with user-defined functions in your language of choice.

Operator 06

Cloud-Native Deployment

Pending
compute-storage-separation cloud flexible

Separation of compute and storage enables flexible cloud deployment. Scale Texera from a laptop to the cloud as your data ecosystem grows.

Operator 07

Runtime Debugging

Pending
interactive-execution debugging breakpoints

Debug workflows interactively at runtime. Inspect intermediate results, set breakpoints, and iterate on your pipeline without restarting from scratch.

Operator 01

Human-AI Collaborative Data Science

Pending
ai-agents chatbots natural-language

Conduct data science through AI agents using natural language. Texera lets you build, refine, and execute analytical workflows conversationally — no coding required.

Operator 02

Agent-Powered Visual Workflows

Pending
drag-drop visual low-code

Build end-to-end data science workflows through an intuitive graphical interface. Drag-and-drop operators, connect them visually, and inspect results as you go.

Operator 03

Real-Time Collaboration

Pending
collaborative-editing shared-execution multi-user

Collaborate on workflows like Google Docs, but for data science. Multiple users can co-edit workflows and share executions in real time within Texera.

Operator 04

Scalable Parallel Engine

Pending
parallel-engine big-data scalable

Process large volumes of data with Texera's parallel backend engine. Designed for high-performance, scalable execution across big-data workloads.

Operator 05

Language-Agnostic Runtime

Pending
python java user-defined-functions

Texera's workflow runtime is language-agnostic, with native support for Python and Java. Extend pipelines with user-defined functions in your language of choice.

Operator 06

Cloud-Native Deployment

Pending
compute-storage-separation cloud flexible

Separation of compute and storage enables flexible cloud deployment. Scale Texera from a laptop to the cloud as your data ecosystem grows.

Operator 07

Runtime Debugging

Pending
interactive-execution debugging breakpoints

Debug workflows interactively at runtime. Inspect intermediate results, set breakpoints, and iterate on your pipeline without restarting from scratch.

Operator 01

Human-AI Collaborative Data Science

Pending
ai-agents chatbots natural-language

Conduct data science through AI agents using natural language. Texera lets you build, refine, and execute analytical workflows conversationally — no coding required.

Operator 02

Agent-Powered Visual Workflows

Pending
drag-drop visual low-code

Build end-to-end data science workflows through an intuitive graphical interface. Drag-and-drop operators, connect them visually, and inspect results as you go.

Operator 03

Real-Time Collaboration

Pending
collaborative-editing shared-execution multi-user

Collaborate on workflows like Google Docs, but for data science. Multiple users can co-edit workflows and share executions in real time within Texera.

Operator 04

Scalable Parallel Engine

Pending
parallel-engine big-data scalable

Process large volumes of data with Texera's parallel backend engine. Designed for high-performance, scalable execution across big-data workloads.

Operator 05

Language-Agnostic Runtime

Pending
python java user-defined-functions

Texera's workflow runtime is language-agnostic, with native support for Python and Java. Extend pipelines with user-defined functions in your language of choice.

Operator 06

Cloud-Native Deployment

Pending
compute-storage-separation cloud flexible

Separation of compute and storage enables flexible cloud deployment. Scale Texera from a laptop to the cloud as your data ecosystem grows.

Operator 07

Runtime Debugging

Pending
interactive-execution debugging breakpoints

Debug workflows interactively at runtime. Inspect intermediate results, set breakpoints, and iterate on your pipeline without restarting from scratch.

Texera screenshot
Apache Incubator
License Apache 2.0
Users 332 Projects 86 Workflows 2,481 Executions 51K Workflow Versions 357K Deployments 7 Largest Deployment 100 nodes, 400 cores
apache/texera

Collaborative Machine-Learning-Centric Data Analytics Using Workflows

Scala 232 122