Pranav
Home
Blog
Projects
Experience
Resume
Papershelf
Contact Me
Open Menu
Home
Blog
Projects
Experience
Resume
Contact
Close Menu
Papershelf
A list of research papers, articles & blogs I've enjoyed reading
MapReduce: Simplified Data Processing on Large Clusters
Making it easy to run MapReduce on large cluster of distributed machines
Near-duplicate Question Detection
An algorithm which uses LLMs for near-duplicate question detection
SIEVE is Simpler than LRU
A new, simple & primitive caching strategy to improve Cache performance
Supporting Resources
:
home
Magnet: Push-based Shuffle Service for Large-scale Data Processing
A spark merging mechanism which merges continuous chunks and stores batches remotely to speed up MapReduce operations
Supporting Resources
:
spark shuffling