SmartNews
Interests
Topics
Top Domains
History
About
Topics
MLC-LLM
Hackernews posts about MLC-LLM
MLC-LLM
: GPT/Llama on consumer-class GPUs and phones
(github.com)
303 points by
junrushao1994
over 1 year ago
|
106 comments
Benchmarking LLM Inference Back Ends: VLLM, LMDeploy,
MLC-LLM
, TensorRT-LLM, TGI
(www.bentoml.com)
15 points by
chaoyu
5 months ago
|
1 comments
MLC LLM
: 70B Llama-2-4bit on MacBook at 50%-80% speed of A100
(twitter.com)
12 points by
junrushao1994
over 1 year ago
|
3 comments
Benchmarking LLM Inference Back Ends: VLLM, LMDeploy,
MLC-LLM
, TRT-LLM, and TGI
(bentoml.com)
12 points by
sherlockxu
6 months ago
|
2 comments
MLC LLM
: Universal LLM Deployment with GPU Acceleration
(github.com)
3 points by
crowwork
over 1 year ago
|
1 comments
Comparing LLM Optimization Tools: VLLM, LMDeploy,
MLC-LLM
, TensorRT-LLM, and TGI
(www.bentoml.com)
2 points by
bbzjk7
6 months ago
|
discuss
MLC LLM
(mlc.ai)
2 points by
tosh
over 1 year ago
|
discuss
MLC LLM
– Large Language Models on iPhone GPU and Many More GPU Platforms
(mlc.ai)
2 points by
crowwork
over 1 year ago
|
discuss
MLC LLM
: Universal Language Model Deployment Across Diverse Hardware and Apps
(llm.mlc.ai)
1 points by
georgehill
11 months ago
|
discuss
Show HN: The fastest way to run Mixtral 8x7B on Apple Silicon Macs
18 points by
woadwarrior01
8 months ago
|
22 comments
Ask HN: What open source LLM and diffusion projects do you rely on?
5 points by
tikkun
about 1 year ago
|
1 comments
Vicuna on iPhone
(mlc.ai)
90 points by
tosh
over 1 year ago
|
15 comments
Bringing Hardware Accelerated Language Models to Android Devices
(github.com)
2 points by
crowwork
over 1 year ago
|
discuss