Latest
-
Nanoflow: Boosting LLM Throughput by Nearly 2x
Nanoflow is an innovative dataflow programming model that nearly doubles Large Language Model (LLM) throughput by leveraging intra-device parallelism.
Nanoflow is an innovative dataflow programming model that nearly doubles Large Language Model (LLM) throughput by leveraging intra-device parallelism.