This research investigates HiFloat4 precision for pre-training large language models on Huawei Ascend NPUs, demonstrating significant efficiency gains over M...
Level: advanced
By Mehran Taghian
Category: research