📋 About
We optimize you neural network training and inference pipelines for your target hardware.
We are specialized in Nvidia GPUs and libraries: CUDA, CuBLAS, CuTLASS, CuDNN, CuTe, NCCL, NVSHMEM. However, we can also work on other platforms and DSLs on-request.
We are an experienced team in quantization, pruning, distillation, distributed systems, parallelization, sharding, compilers and custom kernels such as Flash-Attention variants.
We also train custom models using your proprietary data while maintaini Load more
We are specialized in Nvidia GPUs and libraries: CUDA, CuBLAS, CuTLASS, CuDNN, CuTe, NCCL, NVSHMEM. However, we can also work on other platforms and DSLs on-request.
We are an experienced team in quantization, pruning, distillation, distributed systems, parallelization, sharding, compilers and custom kernels such as Flash-Attention variants.
We also train custom models using your proprietary data while maintaini Load more
×
Login to post a comment
No comments yet. Be the first to share your thoughts!
Login to write a review
No reviews yet. Be the first to review this product!