News

About Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.
The Internet of Things (IoT) in the sixth generation (6G) network is envisioned to evolve towards intelligence, ubiquity, and self-optimization. Large language models (LLMs) have demonstrated ...
Goal: support a set of custom ops for common operators (attention, cached attention, mla, rope, moe, ...) that are purely implemented in pytorch ("torch op backend") and can serve as "golden reference ...