Flash-MoE: Running a 397B Parameter Model on a Laptop
Flash-MoE enables running a massive 397 billion parameter model on consumer laptop hardware through innovative Mixture of Experts architecture and memory optimization techniques. The project demonstra...