04. Developer Guide

This guide is for developers who want to contribute to TritonParse, understand its architecture, or extend its functionality.

🏗️ Architecture Overview

High-Level Architecture

TritonParse consists of three main components:

┌──────────────────────┐    ┌──────────────────────┐    ┌──────────────────────┐
│   Python Backend     │    │   Processing         │    │   Frontend UI        │
│                      │    │                      │    │                      │
│ • Structured Logging │──▶│ • Log Parsing        │──▶│ • React Interface    │
│ • Triton Hooks       │    │ • Source Mapping     │    │ • IR Visualization   │
│ • Trace Generation   │    │ • Data Compression   │    │ • Code Comparison    │
│                      │    │ • Process Launch Trace│   │ • Launch Diff Analysis│
└──────────────────────┘    └──────────────────────┘    └──────────────────────┘

Component Details

1. Python Backend (`tritonparse/`)

Purpose: Capture Triton compilation events and generate structured logs. This package contains the core logic for parsing IR, processing traces, and creating source mappings.
Key Files:
- structured_logging.py - Main logging infrastructure to capture events.
- trace_processor.py - Processes raw trace files, groups events, and generates launch_diff and autotune analysis.
- ir_parser.py - Extracts source location information from various IRs (TTIR, TTGIR, PTX, AMDGCN).
- mapper.py - Creates bidirectional mappings between different IRs and Python source code.
- utils.py - Main CLI entrypoint (unified_parse) and other parsing utilities.
- extract_source_mappings.py - Legacy utility for IR stage correlation.
- event_diff.py - Logic for comparing kernel launch events.
- sourcemap_utils.py - Helper functions for source mapping.
- common.py, shared_vars.py, tp_logger.py - Common utilities, shared state, and logger configuration.

2. Processing Pipeline

Purpose: Transform raw logs into structured, analyzable format.
Key Functions:
- Parse NDJSON logs
- Extract source mappings between IR stages
- Process launch trace
- Compress and package data

3. Frontend UI (`website/`)

Purpose: Interactive visualization and analysis interface
Key Technologies:
- React 19 with TypeScript
- Vite build system
- Tailwind CSS for styling
- Monaco Editor for code display

📁 Project Structure

tritonparse/
├── tritonparse/                 # Python package
│   ├── __init__.py
│   ├── structured_logging.py    # Core logging infrastructure to capture events
│   ├── trace_processor.py       # Processes raw trace files, groups events, and generates diffs
│   ├── ir_parser.py             # Extracts source location information from various IRs
│   ├── mapper.py                # Creates bidirectional mappings between different IRs and Python source code
│   ├── event_diff.py            # Logic for comparing kernel launch events
│   ├── utils.py                 # Main CLI entrypoint (`unified_parse`) and other parsing utilities
│   ├── source_type.py           # Source type definitions (e.g., TTIR, PTX)
│   ├── sourcemap_utils.py       # Helper functions for source mapping
│   ├── common.py                # Common utilities and helper functions
│   ├── shared_vars.py           # Shared state and variables for the package
│   └── tp_logger.py             # Logger configuration
├── website/                     # React web application for visualization
│   ├── src/
│   │   ├── components/          # Reusable React components
│   │   │   ├── ArgumentViewer.tsx   # Displays kernel arguments
│   │   │   ├── CodeViewer.tsx       # Displays IR code with syntax highlighting
│   │   │   ├── DiffViewer.tsx       # Side-by-side diff view for text
│   │   │   ├── CodeComparisonView.tsx # Compares two IRs with line mappings
│   │   │   └── AutotuneAnalysis.tsx # Displays autotuning results
│   │   ├── pages/               # Main application pages
│   │   │   ├── KernelOverview.tsx # Main analysis view for a kernel, combining all components
│   │   │   └── CodeView.tsx     # A focused view for a single IR file
│   │   ├── utils/               # Utility functions
│   │   │   └── dataLoader.ts    # Data loading and processing from parsed logs
│   │   ├── App.tsx              # Main application component routing to pages
│   │   └── main.tsx             # Application entry point
│   ├── public/                  # Static assets (e.g., images, sample data)
│   ├── package.json             # Frontend dependencies and scripts
│   └── vite.config.ts           # Vite build configuration
├── tests/                       # Test suite for the Python package
├── docs/                        # Project documentation
├── .github/                     # GitHub Actions workflows
├── .ci/                         # CI scripts
├── pyproject.toml               # Python project configuration
├── Makefile                     # Development commands
└── README.md                    # Project overview

🔧 Development Environment Setup

Prerequisites

Python >= 3.10
Node.js >= 18.0.0
Triton >= 3.4.0 (latest version recommended)
Git for version control

1. Clone and Setup

# Clone repository
git clone https://github.com/pytorch-labs/tritonparse.git
cd tritonparse

# Install Python dependencies
make install-dev

# Install website dependencies
cd website
npm install

2. Verify Development Setup

# Check formatting and linting
make format-check
make lint-check

3. Verify Setup

# Check Python setup
make format-check
make lint-check
python -m unittest tests.test_tritonparse.TestTritonparseCPU -v

# Check website setup
cd website
npm run dev

🛠️ Development Workflow

Code Style and Formatting

We use a comprehensive formatting pipeline:

Tool	Purpose	Configuration
Black	Code formatting	`pyproject.toml`
usort	Import sorting	`pyproject.toml`
Ruff	Linting	Built-in rules

Essential Commands

# Format code
make format

# Check formatting
make format-check

# Run linting
make lint-check

# Run tests
python -m unittest tests.test_tritonparse -v

# Website development
cd website && npm run dev

Development Quality Checks

Before committing, ensure:

Code is formatted: make format
Linting passes: make lint-check
Tests pass: python -m unittest tests.test_tritonparse -v
Website builds: cd website && npm run build

🏗️ Backend Development

Core Components

1. Structured Logging (`structured_logging.py`)

Purpose: Capture Triton compilation and launch events in structured format

Key Functions:

init(log_path, enable_launch_trace=False) - Initialize logging system.
- log_path: The directory where log files will be stored.
- enable_launch_trace: If True, captures detailed metadata for each kernel launch. This is required for launch analysis.

Integration Points:

Triton compilation hooks
PyTorch TorchInductor integration
Stack trace extraction

2. Log Processing (`utils.py`)

Purpose: Transform raw logs into analyzable format

Key Functions:

unified_parse() - Main parsing interface
oss_run() - OSS-specific parsing logic
parse_logs() - Core log processing

Processing Pipeline:

Read raw NDJSON logs from input directory
Parse and validate log entries
Extract source mappings between IR stages
Compress and save processed data

3. Source Mapping (`extract_source_mappings.py`)

Purpose: Correlate lines between different IR stages

Key Functions:

extract_source_mappings() - Main extraction logic
process_kernel_logs() - Process individual kernel logs
map_ir_stages() - Map lines between IR formats

Adding New Features

Define the new data: Determine what new information needs to be captured.
Update structured_logging.py: Add logic to capture the new data within the appropriate hooks (e.g., pre-compilation, post-compilation).
Modify trace_processor.py: If the new data requires special processing or aggregation (like the launch analysis), add the logic here.
Update unified_parse(): Ensure the new data is handled correctly during the main parsing routine.
Write tests: Add unit and integration tests to tests/ to validate the new feature.

Testing Backend Changes

# Run CPU tests (no GPU required)
python -m unittest tests.test_tritonparse.TestTritonparseCPU -v

# Run GPU tests (requires CUDA)
python -m unittest tests.test_tritonparse.TestTritonparseCUDA -v

# Run specific test
python -m unittest tests.test_tritonparse.TestTritonparseCUDA.test_whole_workflow -v

# Test with real kernel
cd tests
TORCHINDUCTOR_FX_GRAPH_CACHE=0 python test_add.py

🎨 Frontend Development

Technology Stack

React 22 - UI framework
TypeScript - Type safety
Vite - Build tool and dev server
Tailwind CSS - Styling
Monaco Editor - Code display

Key Components

1. Data Loading (`utils/dataLoader.ts`)

Purpose: Load and process trace files

Key Functions:

loadLogData() - Load from URL
loadLogDataFromFile() - Load from file
processKernelData() - Process raw data

2. Code Viewer (`components/CodeViewer.tsx`)

Purpose: Display IR code with syntax highlighting

Features:

Language-specific syntax highlighting
Line number display
Interactive line selection
Source mapping visualization

3. Code Comparison (`components/CodeComparisonView.tsx`)

Purpose: Side-by-side IR comparison

Features:

Synchronized scrolling
Line mapping visualization
Interactive highlighting
Dropdown IR selection

Adding New Features

Update dataLoader.ts: Modify the data loading and processing functions to handle any new data fields from the backend.
Create new components: In website/src/components/, create new React components to display the new information. For example, a new panel in the KernelOverview.tsx or a new view.
Integrate components: Add the new components to the appropriate pages (e.g., KernelOverview.tsx, CodeComparisonView.tsx).
Style the components: Use Tailwind CSS for styling to match the existing interface.
Add tests: If applicable, add tests for the new components or functionality.

Testing Frontend Changes

cd website

# Development server
npm run dev

# Type checking
npm run build

# Linting
npm run lint

# Test with sample data
# Load ./public/f0_fc0_a0_cai-.ndjson in browser

📊 Data Flow

End-to-End Data Flow

Python Code
     │
     ▼
Triton Compilation
(triggers Hook Events)
     │
     ▼
Structured Logging
     │
     ▼
Raw NDJSON Logs
     │
     ▼
Log Processing
  - Source Mapping
  - Launch Analysis
     │
     ▼
Compressed Data
     │
     ▼
Web Interface
     │
     ▼
Interactive Visualization

Data Formats

1. Raw NDJSON Format

{
  "event_type": "compilation_start",
  "timestamp": 1234567890,
  "kernel_name": "add_kernel",
  "metadata": {...}
}

2. Processed Format

{
  "kernels": [
    {
      "hash": "abc123",
      "name": "add_kernel",
      "metadata": {...},
      "irFiles": {
        "ttgir": "...",
        "ptx": "..."
      },
      "sourceMappings": {
        "ttgir": {...},
        "ptx": {...}
      }
    }
  ]
}

🔍 Debugging and Development Tools

Debug Logging

# Enable debug logging
export TRITONPARSE_DEBUG=1

# Run with debug output
python your_script.py

Development Utilities

# Check log file contents
head -n 10 ./logs/*.ndjson

# Inspect compressed data
zcat ./parsed_output/*.gz | head -n 20

# Test parsing pipeline
python -c "
import tritonparse.utils
tritonparse.utils.unified_parse('./logs/', './test_output/', verbose=True)
"

Browser Developer Tools

// Enable frontend debug logging
localStorage.setItem('tritonparse-debug', 'true');

// Inspect loaded data
console.log(window.tritonparseData);

// Test data processing
import { processKernelData } from './utils/dataLoader';
console.log(processKernelData(rawData));

🧪 Testing

Test Structure

tests/
├── test_tritonparse.py         # Main test suite
├── test_add.py                 # Manual test example
└── example_output/             # Sample data

Running Tests

# All tests
python -m unittest tests.test_tritonparse -v

# CPU-only tests
python -m unittest tests.test_tritonparse.TestTritonparseCPU -v

# GPU tests (requires CUDA)
python -m unittest tests.test_tritonparse.TestTritonparseCUDA -v

# Manual test
cd tests
TORCHINDUCTOR_FX_GRAPH_CACHE=0 python test_add.py

Writing Tests

To add a new end-to-end test case, you should follow the structure of existing tests in TestTritonparseCUDA. The general workflow is as follows:

Define a test method: Create a new method inside TestTritonparseCUDA with a name starting with test_.
Define a Triton kernel: Write a simple Triton kernel that demonstrates the feature you want to test. This can be defined directly inside the test method.
Set up a temporary environment: Use tempfile.mkdtemp() to create temporary directories for logs and parsed output.
Initialize tritonparse logging: Call tritonparse.structured_logging.init() to start capturing events.
Run the kernel: Execute the kernel to generate compilation and launch events. Run it multiple times if you need to test launch_diff functionality.
Parse the logs: Call tritonparse.utils.unified_parse() to process the raw logs.
Assert the results: Check the contents of the raw log files or the parsed output to verify that the behavior is correct.
Clean up: Use a try...finally block to ensure the temporary directory is always removed.

Here is a simplified example illustrating how to add a new test:

# In tests/test_tritonparse.py, inside TestTritonparseCUDA

@unittest.skipUnless(torch.cuda.is_available(), "CUDA not available")
def test_new_feature_workflow(self):
    """Test a new feature in the end-to-end workflow."""

    # 1. Define the kernel for the test
    @triton.jit
    def my_new_kernel(x_ptr, y_ptr, n_elements, BLOCK_SIZE: tl.constexpr):
        # ... kernel implementation ...
        pid = tl.program_id(axis=0)
        offsets = pid * BLOCK_SIZE + tl.arange(0, BLOCK_SIZE)
        x = tl.load(x_ptr + offsets, mask=offsets < n_elements)
        tl.store(y_ptr + offsets, x, mask=offsets < n_elements)

    # 2. Set up temporary directories
    temp_dir = tempfile.mkdtemp()
    log_path = os.path.join(temp_dir, "logs")
    parsed_path = os.path.join(temp_dir, "parsed")
    os.makedirs(log_path, exist_ok=True)
    os.makedirs(parsed_path, exist_ok=True)

    # 3. Initialize logging and ensure cleanup
    tritonparse.structured_logging.init(log_path, enable_launch_trace=True)
    try:
        # 4. Run the kernel to generate logs
        x = torch.randn(128, device="cuda")
        y = torch.empty_like(x)
        my_new_kernel[(1,)](x, y, 128, BLOCK_SIZE=128)
        torch.cuda.synchronize()

        # 5. Parse the generated logs
        tritonparse.utils.unified_parse(source=log_path, out=parsed_path)

        # 6. Verify the output
        parsed_files = os.listdir(parsed_path)
        self.assertGreater(len(parsed_files), 0, "Parsing did not produce output files.")

        # ... (add more specific assertions on file contents) ...

    finally:
        # 7. Clean up the temporary directory
        shutil.rmtree(temp_dir)
        tritonparse.structured_logging.clear_logging_config()

📦 Release Process

Version Management

Versions are managed in:

pyproject.toml - Python package version
website/package.json - Frontend version

Release Steps

Update version numbers
Update CHANGELOG.md
Run full test suite
Build and test website
Create GitHub release
Deploy to GitHub Pages

GitHub Actions

CI/CD pipeline includes:

Format checking - Code style validation
Linting - Code quality checks
Testing - Python and frontend tests
Website deployment - Automatic GitHub Pages deployment

🤝 Contributing Guidelines

Pull Request Process

Fork the repository
Create feature branch: git checkout -b feature-name
Make changes following coding standards
Add tests for new functionality
Run formatting: make format
Run tests: make lint-check && python -m unittest tests.test_tritonparse -v
Submit pull request

Code Review Process

All PRs require review by core maintainers
CI checks must pass before merge
Documentation updates required for new features
Tests required for new functionality

Issue Reporting

When reporting issues:

Use issue templates provided
Include system information
Provide reproduction steps
Include error messages and logs

📚 Additional Resources

Documentation

Code Formatting Guide - Detailed formatting standards
API Reference - Complete API documentation
Architecture Deep Dive - Detailed architecture

Community

GitHub Discussions - Community Q&A
GitHub Issues - Bug reports and feature requests

External Resources

Triton Documentation - Official Triton docs
React Documentation - React development guide
TypeScript Documentation - TypeScript reference

🔗 Next Steps

For new developers:

Complete the Installation Guide
Read the Usage Guide to understand the tool
Explore the codebase starting with simple components
Run the test suite to verify your setup
Join GitHub Discussions for community support

For experienced contributors:

Check GitHub Issues for open tasks
Review the Architecture Deep Dive for advanced topics
Contribute to documentation improvements
Propose new features through GitHub Discussions

04. Developer Guide

🏗️ Architecture Overview

High-Level Architecture

Component Details

1. Python Backend (tritonparse/)

2. Processing Pipeline

3. Frontend UI (website/)

📁 Project Structure

🔧 Development Environment Setup

Prerequisites

1. Clone and Setup

2. Verify Development Setup

3. Verify Setup

🛠️ Development Workflow

Code Style and Formatting

Essential Commands

Development Quality Checks

🏗️ Backend Development

Core Components

1. Structured Logging (structured_logging.py)

2. Log Processing (utils.py)

3. Source Mapping (extract_source_mappings.py)

Adding New Features

Testing Backend Changes

🎨 Frontend Development

Technology Stack

Key Components

1. Data Loading (utils/dataLoader.ts)

2. Code Viewer (components/CodeViewer.tsx)

3. Code Comparison (components/CodeComparisonView.tsx)

Adding New Features

Testing Frontend Changes

📊 Data Flow

End-to-End Data Flow

Data Formats

1. Raw NDJSON Format

2. Processed Format

🔍 Debugging and Development Tools

Debug Logging

Development Utilities

Browser Developer Tools

🧪 Testing

Test Structure

Running Tests

Writing Tests

📦 Release Process

Version Management

Release Steps

GitHub Actions

🤝 Contributing Guidelines

Pull Request Process

Code Review Process

Issue Reporting

📚 Additional Resources

Documentation

Community

External Resources

🔗 Next Steps

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally

1. Python Backend (`tritonparse/`)

3. Frontend UI (`website/`)

1. Structured Logging (`structured_logging.py`)

2. Log Processing (`utils.py`)

3. Source Mapping (`extract_source_mappings.py`)

1. Data Loading (`utils/dataLoader.ts`)

2. Code Viewer (`components/CodeViewer.tsx`)

3. Code Comparison (`components/CodeComparisonView.tsx`)