
HGPU group
@hgpu
Followers
3K
Following
104
Media
9
Statuses
10K
High performance computing on graphics processing units (GPU): AMD/ATI, nVidia, Intel Xeon Phi, CUDA, OpenCL, OpenGL, GPGPU, HPC
Joined May 2011
Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems. #CUDA #TaskScheduling #Package.
0
0
1
Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms. #CUDA #GPUcluster #Communication.
0
1
6
KIS-S: A GPU-Aware Kubernetes Inference Simulator with RL-Based Auto-Scaling. #GPU #Kubernets #Package.
0
1
4
Accelerated discovery and design of Fe-Co-Zr magnets with tunable magnetic anisotropy through machine learning and parallel computing. #CUDA #Physics #MaterialsScience #CondensedMatter #MachineLearning #ML #Package.
0
0
1
ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks. #CUDA #OpenMP #LLM #CodeGeneration #Benchmarking #Package.
0
0
1
P4OMP: Retrieval-Augmented Prompting for OpenMP Parallelism in Serial Code. #OpenMP #LLM #HPC #CodeGeneration.
0
0
0
No More Shading Languages: Compiling C++ to Vulkan Shaders. #Vulkan #Compilers #GLSL #Rendering #Raytracing #Package.
0
0
5
GCStack+GCScaler: Fast and Accurate GPU Performance Analyses Using Fine-Grained Stall Cycle Accounting and Interval Analysis. #CUDA #Performance.
0
1
1
Engineering Supercomputing Platforms for Biomolecular Applications. #CUDA #ROCm #Biology #Biomolecules #MolecularDynamics #HPC #Physics #Package.
0
0
0
A Novel Compiler Transformation for Fast Sparse Matrix Multiplication in GPUs. #CUDA #Compilers #Sparse #MatrixMultiplication.
0
0
5
LiteGD: Lightweight and dynamic GPU Dispatching for Large-scale Heterogeneous Clusters. #GPUcluster.
0
0
1