Abstract: The Zebra Optimization Algorithm (ZOA) is a newly proposed biology-based metaheuristic method drawn from zebras in nature, designed to tackle optimization problems. As with other ...
Implementation of "Breaking the Low-Rank Dilemma of Linear Attention" The Softmax attention mechanism in Transformer models is notoriously computationally expensive, particularly due to its quadratic ...
Abstract: Stochastic optimization algorithms are widely used to solve large-scale machine learning problems. However, their theoretical analysis necessitates access to unbiased estimates of the true ...
Kimi-K2-Mini is an experimental compressed version of the 1.07T parameter Kimi-K2 model, targeting ~32.5B parameters for more accessible deployment. This project explores several optimization ...