Making Complex Data Flows Understandable at Scale

This project rebuilt Select Star's core data lineage view to handle 10,000+ node graphs without breaking. The redesign cut load time by 96% (12.9s → 0.4s), eliminated browser crashes, and lifted DAU by 28% post-launch.

96% Load time reduction
12.9s → 0.4s

+28% DAU increase
post-launch

0 Browser crashes on
10,000+ node graphs

Context & Challenge

The core differentiator was becoming a liability

Select Star's data lineage view was the platform's core differentiator — until it wasn't. As enterprise customers scaled from hundreds to tens of thousands of data assets, the original "show everything" canvas turned unreadable and crashed browsers on graphs above 5,000 nodes.

The challenge: preserve the analytical depth that data engineers depended on, while making it usable at enterprise scale.

Approach

Starting with an assumption — then proving it wrong

I led the redesign end-to-end with a small product, engineering, and data team. We started with the assumption that better rendering and visual hierarchy would carry us through — essentially, scale the existing model.

When we pushed the prototype to a staging environment running real customer data, it crashed. Performance issues that engineering had flagged in V1 had compounded with the new visual layer. Rather than optimize harder, I proposed a more fundamental pivot — and validated it with users before committing to the rebuild.

Discovery: Validating the pivot before building

I ran prototype-testing sessions with data engineers across our pillar accounts to answer three questions:

1 Is the new UI meaningfully different enough to justify the rebuild?

2 Does the prototype match how users actually trace dependencies?

3 What do customers prioritize — performance or end-to-end visibility?

The findings were unambiguous. Users completed tasks faster and immediately noticed the difference. They kept asking to focus on one node without losing context. And every customer chose performance over completeness — every time.

The pivot

Show Everything → Context on Demand

Instead of rendering the full graph upfront and asking the browser to handle it, V2 starts with a single node and lets users expand outward — pulling context only when they need it. Fewer nodes on screen at any moment, but each one rendered with the depth and density data engineers actually use.

Before

After

Design Highlights

Key design decisions

          01
          Edge color system and interaction logic
        

A consistent visual language for upstream, downstream, and column-level dependencies that scales across thousands of nodes without becoming noise.

          02
          Logic-driven component system
        

I designed the new lineage components to be reusable across product surfaces. After shipping in Lineage, the same system extended to ERD and Semantic Layer, cutting component build time by 40%.

          03
          Visual hierarchy for cross-stack workflows
        

Enterprise users move between Snowflake, BigQuery, dbt, Looker and more. Clear visual differentiation between sources reduced cognitive load when reading complex flows.

Outcomes & Impact

Shipped as the new default for all enterprise customers

Load time reduced from 12.9s to 0.4s — 96% faster

Zero browser crashes on 10,000+ node graphs

+28% DAU post-launch

Engineering UI implementation time reduced by 50%+ via the shared component system

Scaled to support 55+ enterprise customers, including multiple Fortune 500s

The redesign became one of Select Star's strongest differentiators in late-stage enterprise sales — and a contributing factor to the company's acquisition by Snowflake in December 2025.

Back to work