Understanding the Fan-Out/Fan-In Pattern for Concurrency

Overview

The Fan-out/Fan-in pattern is a concurrency pattern that distributes work across multiple workers (fan-out) and then collects results from all workers (fan-in). This pattern is essential for parallel processing of independent tasks, load balancing, and improving throughput for CPU-intensive operations.

NOTE: For other posts on concurrency patterns, check out the index post to this series of concurrency patterns.

Implementation Details

Structure

The fan-out/fan-in implementation in examples/fan.go consists of three main components:

Work Generation – Creates work items to be processed
Fan-out – Distributes work across multiple workers
Fan-in – Collects results from all workers

Code Analysis

func RunFan() {
    // Generate work items
    workItems := generateWorkItems(20)
    
    // Fan out: Distribute work across multiple workers
    numWorkers := 4
    results := fanOut(workItems, numWorkers)
    
    // Fan in: Collect results from all workers
    finalResults := fanIn(results)
    
    // Collect and display results
    for result := range finalResults {
        fmt.Printf("Processed: Item %d -> %s (by Worker %d)\n", 
                   result.OriginalID, result.Processed, result.WorkerID)
    }
}

Code Breakdown:

Work Generation: generateWorkItems(20) creates a channel containing 20 work items. This simulates a real-world scenario where you have a stream of data to process.
Fan-out Phase: fanOut(workItems, numWorkers) launches 4 worker goroutines, each consuming from the same workItems channel. This distributes the workload across multiple concurrent workers.
Fan-in Phase: fanIn(results) takes the individual result channels from each worker and merges them into a single output channel. This consolidates all results for processing.
Result Processing: The final loop consumes results from the merged channel as they become available, displaying which worker processed each item.

Work Item Structure

type WorkItem struct {
    ID   int
    Data string
}

type Result struct {
    OriginalID int
    Processed  string
    WorkerID   int
}

Structure explanation:

WorkItem: Represents a single unit of work to be processed
- ID: Unique identifier for tracking and debugging
- Data: The actual data to be processed (could be a file path, URL, etc.)
Result: Contains the processed output and metadata
- OriginalID: Links the result back to the original work item
- Processed: The transformed/processed data
- WorkerID: Identifies which worker processed this item (useful for debugging and load analysis)

Fan-out Implementation

func fanOut(jobs <-chan WorkItem, numWorkers int) []<-chan Result {
    var workers []chan Result
    var wg sync.WaitGroup
    
    // Create workers
    for i := 0; i < numWorkers; i++ {
        workerResults := make(chan Result)
        workers = append(workers, workerResults)
        
        wg.Add(1)
        go worker(i+1, jobs, workerResults, &wg)
    }
    
    // Close worker result channels when all workers are done
    go func() {
        wg.Wait()
        for _, workerChan := range workers {
            close(workerChan)
        }
    }()
    
    // Convert to read-only channels for return
    var resultChannels []<-chan Result
    for _, ch := range workers {
        resultChannels = append(resultChannels, ch)
    }
    
    return resultChannels
}

Detailed breakdown of fan-out:

Worker Creation Loop:
- Creates numWorkers individual result channels (one per worker)
- Each worker gets its own channel to prevent blocking between workers
- Uses sync.WaitGroup to track when all workers complete
Goroutine Launch:
- go worker(i+1, jobs, workerResults, &wg) starts each worker concurrently
- All workers share the same jobs channel (fan-out)
- Each worker writes to its own workerResults channel
Cleanup Goroutine:
- Runs in background to wait for all workers to finish
- Closes each worker’s result channel when all work is complete
- Prevents deadlocks and ensures proper resource cleanup
Channel Type Conversion:
- Converts internal chan Result to <-chan Result (read-only)
- Provides encapsulation – callers can only read from channels
- Prevents external code from accidentally closing worker channels

Worker Implementation

func worker(id int, jobs <-chan WorkItem, results chan<- Result, wg *sync.WaitGroup) {
    defer wg.Done()
    
    for job := range jobs {
        // Simulate processing work
        time.Sleep(time.Duration(rand.Intn(200)+100) * time.Millisecond)
        
        result := Result{
            OriginalID: job.ID,
            Processed:  fmt.Sprintf("processed-%s-by-worker-%d", job.Data, id),
            WorkerID:   id,
        }
        
        fmt.Printf("Worker %d processed item %d\n", id, job.ID)
        results <- result
    }
}

Worker function breakdown:

Function Signature:
- id: Unique worker identifier for tracking and debugging
- jobs <-chan WorkItem: Read-only channel for receiving work
- results chan<- Result: Write-only channel for sending results
- wg *sync.WaitGroup: For signaling completion
Resource Management:
- defer wg.Done() ensures the worker signals completion even if it panics
- Automatic cleanup when the function exits
Work Processing Loop:
- for job := range jobs continuously processes jobs until the channel closes
- Each worker competes for jobs from the shared channel (automatic load balancing)
- When the jobs channel closes, all workers naturally exit the loop
Simulated Processing:
- time.Sleep(time.Duration(rand.Intn(200)+100) * time.Millisecond) simulates variable processing time
- Random delay between 100-300ms mimics real-world processing variance
- Makes concurrency visible in output (workers process at different speeds)
Result Creation and Sending:
- Creates a Result struct with original ID, processed data, and worker ID
- results <- result sends the result to the worker’s dedicated channel
- Non-blocking because each worker has its own channel

Fan-in Implementation

func fanIn(inputs []<-chan Result) <-chan Result {
    out := make(chan Result)
    var wg sync.WaitGroup
    
    // Function to forward results from one input channel
    forward := func(c <-chan Result) {
        defer wg.Done()
        for result := range c {
            out <- result
        }
    }
    
    wg.Add(len(inputs))
    for _, input := range inputs {
        go forward(input)
    }
    
    // Close output channel when all inputs are done
    go func() {
        wg.Wait()
        close(out)
    }()
    
    return out
}

Fan-in function breakdown:

Setup:
- Creates a single output channel out that will receive all results
- Uses sync.WaitGroup to track when all forwarding goroutines complete
Forward Function:
- forward is a closure that forwards results from one input channel
- defer wg.Done() ensures proper cleanup
- for result := range c reads all results from its assigned input channel
- out <- result forwards each result to the unified output channel
Goroutine Launch:
- Launches one goroutine per input channel using go forward(input)
- Each goroutine handles one worker’s results independently
- Non-blocking – if one worker is slow, others can still forward results
Cleanup Goroutine:
- Waits for all forwarding goroutines to complete
- Closes the output channel when all inputs are exhausted
- Prevents deadlocks and signals to consumers that no more results are coming
Return Value:
- Returns <-chan Result (read-only) for encapsulation
- Callers can only consume results, not close the channel

How It Works

Work Generation: Creates a stream of work items to be processed
Fan-out: Launches multiple worker goroutines, each consuming from the same job channel
Worker Processing: Each worker processes jobs independently and sends results to its own result channel
Fan-in: Multiple goroutines forward results from worker channels to a single output channel
Result Collection: The main function consumes all results from the fan-in channel

The pattern enables true parallel processing – multiple workers can process different jobs simultaneously, and results are collected as they complete.

Why This Implementation?

Channel-based Distribution

Automatic Load Balancing: Workers naturally consume jobs as they become available
Backpressure: If workers are slow, the job channel provides natural backpressure
Fair Distribution: All workers have equal access to jobs

Individual Result Channels

Isolation: Each worker has its own result channel, preventing blocking
Order Independence: Results can be collected in any order
Scalability: Easy to add or remove workers

WaitGroup Synchronization

Proper Cleanup: Ensures all workers complete before closing channels
Resource Management: Prevents goroutine leaks
Coordinated Shutdown: All workers finish before fan-in completes

Goroutine-per-Forward

Non-blocking Collection: Each worker’s results are forwarded independently
Concurrent Collection: Results are collected as soon as they’re available
Efficient Resource Usage: No worker blocks waiting for others

Key Design Decisions

Shared Job Channel: All workers read from the same job channel, providing automatic load balancing
Individual Result Channels: Each worker has its own result channel to prevent blocking
Read-only Channel Returns: The fan-out function returns read-only channels for safety
Simulated Processing Time: Random delays simulate real work and make concurrency visible
Structured Results: Results include metadata about which worker processed each item

Common Use Cases

Parallel Data Processing

Image Processing: Process multiple images simultaneously
Document Processing: Parse, analyze, or transform multiple documents
Data Validation: Validate large datasets across multiple workers

API Request Handling

Microservice Calls: Make concurrent API calls to multiple services
Data Aggregation: Fetch data from multiple sources simultaneously
Load Testing: Simulate multiple concurrent users

Batch Processing

File Processing: Process multiple files in parallel
Database Operations: Execute multiple queries concurrently
Report Generation: Generate multiple reports simultaneously

Real-time Systems

Sensor Data Processing: Process data from multiple sensors
Log Analysis: Analyze logs from multiple sources
Monitoring: Collect metrics from multiple systems

Machine Learning

Model Training: Train multiple models in parallel
Feature Processing: Process features across multiple workers
Hyperparameter Tuning: Test multiple parameter combinations

The fan-out/fan-in pattern is particularly effective when you have independent tasks that can be processed in parallel and when you want to maximize resource utilization while maintaining result collection order.

The final example implementation looks like this:

package examples

import (
	"fmt"
	"math/rand"
	"sync"
	"time"
)

// Fan demonstrates the fan-out/fan-in pattern
func RunFan() {
	fmt.Println("=== Fan-out/Fan-in Pattern Example ===")

	// Generate work items
	workItems := generateWorkItems(20)

	// Fan out: Distribute work across multiple workers
	numWorkers := 4
	results := fanOut(workItems, numWorkers)

	// Fan in: Collect results from all workers
	finalResults := fanIn(results)

	fmt.Printf("Distributing %d work items across %d workers...\n", 20, numWorkers)
	fmt.Println()

	// Collect and display results
	count := 0
	for result := range finalResults {
		fmt.Printf("Processed: Item %d -> %s (by Worker %d)\n", result.OriginalID, result.Processed, result.WorkerID)
		count++
	}

	fmt.Printf("\nFan-out/Fan-in completed! Processed %d items.\n", count)
}

// WorkItem represents a unit of work
type WorkItem struct {
	ID   int
	Data string
}

// Result represents the processed work item
type Result struct {
	OriginalID int
	Processed  string
	WorkerID   int
}

// Generate work items
func generateWorkItems(count int) <-chan WorkItem {
	out := make(chan WorkItem)
	go func() {
		defer close(out)
		for i := 0; i < count; i++ {
			item := WorkItem{
				ID:   i,
				Data: fmt.Sprintf("data-%d", i),
			}
			fmt.Printf("Generated work item: %d\n", i)
			out <- item
			time.Sleep(50 * time.Millisecond)
		}
	}()
	return out
}

// Worker function that processes work items
func worker(id int, jobs <-chan WorkItem, results chan<- Result, wg *sync.WaitGroup) {
	defer wg.Done()

	for job := range jobs {
		// Simulate processing work
		time.Sleep(time.Duration(rand.Intn(200)+100) * time.Millisecond)

		result := Result{
			OriginalID: job.ID,
			Processed:  fmt.Sprintf("processed-%s-by-worker-%d", job.Data, id),
			WorkerID:   id,
		}

		fmt.Printf("Worker %d processed item %d\n", id, job.ID)
		results <- result
	}
}

// Fan out: Distribute work across multiple workers
func fanOut(jobs <-chan WorkItem, numWorkers int) []<-chan Result {
	var workers []chan Result
	var wg sync.WaitGroup

	// Create workers
	for i := 0; i < numWorkers; i++ {
		workerResults := make(chan Result)
		workers = append(workers, workerResults)

		wg.Add(1)
		go worker(i+1, jobs, workerResults, &wg)
	}

	// Close worker result channels when all workers are done
	go func() {
		wg.Wait()
		for _, workerChan := range workers {
			close(workerChan)
		}
	}()

	// Convert to read-only channels for return
	var resultChannels []<-chan Result
	for _, ch := range workers {
		resultChannels = append(resultChannels, ch)
	}

	return resultChannels
}

// Fan in: Collect results from multiple channels
func fanIn(inputs []<-chan Result) <-chan Result {
	out := make(chan Result)
	var wg sync.WaitGroup

	// Function to forward results from one input channel
	forward := func(c <-chan Result) {
		defer wg.Done()
		for result := range c {
			out <- result
		}
	}

	wg.Add(len(inputs))
	for _, input := range inputs {
		go forward(input)
	}

	// Close output channel when all inputs are done
	go func() {
		wg.Wait()
		close(out)
	}()

	return out
}

To run this example, and build the code yourself, check out this and other examples in the go-fluency-concurrency-model-patterns repo. That’s it for this topic, tomorrow I’ll post on the worker pools pattern.

Go Concurrency Patterns(Fan-out/Fan-in Pattern)

Overview

Implementation Details

Structure

Code Analysis

Work Item Structure

Fan-out Implementation

Worker Implementation

Fan-in Implementation

How It Works

Why This Implementation?

Channel-based Distribution

Individual Result Channels

WaitGroup Synchronization

Goroutine-per-Forward

Key Design Decisions

Common Use Cases

Parallel Data Processing

API Request Handling

Batch Processing

Real-time Systems

Machine Learning

Like this:

Published by Adron

2 thoughts on “Go Concurrency Patterns(Fan-out/Fan-in Pattern)”

Overview

Implementation Details

Structure

Code Analysis

Work Item Structure

Fan-out Implementation

Worker Implementation

Fan-in Implementation

How It Works

Why This Implementation?

Channel-based Distribution

Individual Result Channels

WaitGroup Synchronization

Goroutine-per-Forward

Key Design Decisions

Common Use Cases

Parallel Data Processing

API Request Handling

Batch Processing

Real-time Systems

Machine Learning

Share this:

Like this:

Published by Adron

2 thoughts on “Go Concurrency Patterns(Fan-out/Fan-in Pattern)”

Discover more from Adron's Composite Code