hgpu Profile Banner
HGPU group Profile
HGPU group

@hgpu

Followers
3K
Following
104
Media
9
Statuses
10K

High performance computing on graphics processing units (GPU): AMD/ATI, nVidia, Intel Xeon Phi, CUDA, OpenCL, OpenGL, GPGPU, HPC

Joined May 2011
Don't wanna be here? Send us removal request.
@hgpu
HGPU group
15 hours
Mutual-Supervised Learning for Sequential-to-Parallel Code Translation. #CUDA #HPC #LLM #CodeGeneration #Package.
0
0
1
@hgpu
HGPU group
15 hours
Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems. #CUDA #TaskScheduling #Package.
0
0
1
@hgpu
HGPU group
15 hours
Serving LLMs in HPC Clusters: A Comparative Study of Qualcomm Cloud AI 100 Ultra and High-Performance GPUs. #Qualcomm #Cloud #LLM #HPC #DeepLearning #DL.
2
2
4
@hgpu
HGPU group
15 hours
Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms. #CUDA #GPUcluster #Communication.
0
1
6
@hgpu
HGPU group
15 hours
KIS-S: A GPU-Aware Kubernetes Inference Simulator with RL-Based Auto-Scaling. #GPU #Kubernets #Package.
0
1
4
@hgpu
HGPU group
8 days
Accelerated discovery and design of Fe-Co-Zr magnets with tunable magnetic anisotropy through machine learning and parallel computing. #CUDA #Physics #MaterialsScience #CondensedMatter #MachineLearning #ML #Package.
0
0
1
@hgpu
HGPU group
8 days
Thesis: Efficient GPU Implementation of Multi-Precision Integer Division. #CUDA #Futhark #Package.
0
0
4
@hgpu
HGPU group
8 days
Libra: Synergizing CUDA and Tensor Cores for High-Performance Sparse Matrix Multiplication. #CUDA #Sparse #SpMM #DeepLearning #DL #Package.
0
0
2
@hgpu
HGPU group
8 days
ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks. #CUDA #OpenMP #LLM #CodeGeneration #Benchmarking #Package.
0
0
1
@hgpu
HGPU group
8 days
P4OMP: Retrieval-Augmented Prompting for OpenMP Parallelism in Serial Code. #OpenMP #LLM #HPC #CodeGeneration.
0
0
0
@hgpu
HGPU group
15 days
No More Shading Languages: Compiling C++ to Vulkan Shaders. #Vulkan #Compilers #GLSL #Rendering #Raytracing #Package.
0
0
5
@hgpu
HGPU group
15 days
GCStack+GCScaler: Fast and Accurate GPU Performance Analyses Using Fine-Grained Stall Cycle Accounting and Interval Analysis. #CUDA #Performance.
0
1
1
@hgpu
HGPU group
15 days
Omniwise: Predicting GPU Kernels Performance with LLMs. #ROCm #LLM #Performance.
0
0
3
@hgpu
HGPU group
15 days
Survey of HPC in US Research Institutions. #HPC #AI.
0
0
2
@hgpu
HGPU group
15 days
WiLLM: An Open Wireless LLM Communication System. #LLM #Package.
0
0
0
@hgpu
HGPU group
22 days
Engineering Supercomputing Platforms for Biomolecular Applications. #CUDA #ROCm #Biology #Biomolecules #MolecularDynamics #HPC #Physics #Package.
0
0
0
@hgpu
HGPU group
22 days
A First Look at Bugs in LLM Inference Engines. #LLM #AI.
0
1
1
@hgpu
HGPU group
22 days
A CPU+FPGA OpenCL Heterogeneous Computing Platform for Multi-Kernel Pipeline. #OpenCL #FPGA.
0
0
6
@hgpu
HGPU group
22 days
A Novel Compiler Transformation for Fast Sparse Matrix Multiplication in GPUs. #CUDA #Compilers #Sparse #MatrixMultiplication.
0
0
5
@hgpu
HGPU group
22 days
LiteGD: Lightweight and dynamic GPU Dispatching for Large-scale Heterogeneous Clusters. #GPUcluster.
0
0
1