Brian Won (Injong)

Brian Won

Computer Science student at University of Toronto specializing in focused on ml inference and low-level system optimization

About

Hey! I'm Brian; cs grad from UofT. I'm interested in ml inference and low-level network system.

My focus is on inference optimization to large-scale ML infrastructure with with strong interest in understanding how technical performance impacts AI system reliability and user experience.

Technical Skills

Languages

  • Python (OOP, multithreading)
  • C/C++
  • Java
  • JavaScript
  • SQL
  • HTML/CSS

Frameworks & Tools

  • React, Next.js
  • Express, Node.js
  • Git, Docker
  • REST APIs
  • Linux, XML

Skill

  • Deep learning Inference
  • Latency Optimization
  • Distributed Systems
  • Linear Programming
  • Network Security

Featured Projects

EdgeRAG

Interactive Demo Available
  • Actively developing an edge-optimized RAG system for financial analysis, focusing on retrieval accuracy, latency, and response grounding.
  • Iterating on document ingestion and embedding strategies, including chunking heuristics and metadata-aware retrieval for financial text.
  • Exploring evaluation methodologies for RAG systems by comparing standalone LLM outputs against retrieval-augmented responses for factual consistency.
  • Experimenting with deployment constraints at the edge, including memory usage, model size trade-offs, and inference performance.
  • Investigating future extensions such as real-time data integration, hybrid dense–sparse retrieval, and robust prompt orchestration.
Python Vector DB Retrieval-Augmented-Generation FastAPI React
Try Demo GitHub

Experience

2025

University of Toronto

Developed concurrent user contest platform for nationwide programming challenges, improving platform scalability.

2024

Vector Institute

Research Assistant focusing on networked ML inference, optimizing distributed systems for lower noise and latency.

2023

RBC Capital Markets

Developed latency optimization tools for trading infrastructure using stochastic modeling and data-driven analytics.

2019-2020

IBM Watson

Enhanced integration testing platform for the DB2 team, improving reliability and coverage of automated test pipelines.

Teaching Assistant
University of Toronto
Multiple Terms
  • CSC165: Mathematical Foundations for Computer Science
  • CSC343: Database Management Systems
  • CSC2209: Networking Systems (Graduate Network-ML)
  • Network Algorithms, Operating Systems

Education

Bachelor of Science in Computer Science

University of Toronto • Class of 2025

Specialization: ML & Systems

Role: Teaching Assistant for Network Algorithms, Operating Systems, and other courses