Hi, I'm Yue Jiang.
Programmer, System Builder, Cat Lover
A passionate computer system developer. I’m excited to build high-performance large-scale systems.
A passionate computer system developer. I’m excited to build high-performance large-scale systems.
I worked on Ingest Actions: a key feature for data masking, filtering, and routing.
I worked on OpsAdapter, a MarketData & Trading Interfaces System for trading and supporting the quantitative research team.
Hubble, a Data Management System:
Sisyphus, a Data Transfer System:
Courseworks
Honors
Courseworks
Implemented a hybrid file system with a local SSD and a cloud storage service similar to Amazon S3, maintaining small objects as well as metadata on the SSD and large objects on the cloud storage.
Applied block-level deduplication based on Rabin Fingerprinting to reduce cloud storage cost by around 50%.
Supported generation of consistent CloudFS snapshots (backups) which support restoring the file system to a previous state or altering the state of the file system.
Leveraged the spare capacity on the SSD as a cache for cloud backed data based on Least Recently Used (LRU) policy. This improved performance and further reduced cloud service costs from average $37 to $18.59 under 10 workloads.
Implemented a buffer pool manager responsible for moving physical pages back and forth from main memory to disk, supporting databases that are larger than the amount of memory available to the system.
Implemented a concurrent disk-backed hash table with extendible hashing scheme for fast data retrieval.
Designed a hybrid log-block mapping scheme to translate read/write requests on logical blocks into low-level operations on SSDs, and garbage collection policies such as LRU and Cost-Benefit proposed in log-structured file system.
Proposed a dynamic mapping scheme inspired by log-structured file systems and a Cost-Endurance-Benefit garbage collection policy, optimizing the performance in wear-leveling by 10x and decreasing write amplification rate by 30%.
Built an RPC system to allow concurrent remote file operations based on TCP.
Designed a check-on-use caching protocol for a distributed file system using session semantics.
Developed distributed transactions with two-phase commit, utilizing logging to persistent storage for failure recovery.