HiFloat4 Format for Language Model Pre-training on Ascend NPUs

This research investigates HiFloat4 precision for pre-training large language models on Huawei Ascend NPUs, demonstrating significant efficiency gains over M...

Level: advanced

By Mehran Taghian

Category: research