bolha.us is one of the many independent Mastodon servers you can use to participate in the fediverse.
We're a Brazilian IT Community. We love IT/DevOps/Cloud, but we also love to talk about life, the universe, and more. | Nós somos uma comunidade de TI Brasileira, gostamos de Dev/DevOps/Cloud e mais!

Server stats:

254
active users

#sycl

0 posts0 participants0 posts today
Amartya<p>My brain is absolutely fried. <br>Today is the last day of coursework submissions for this semester. What a hectic month. <br>DNN with PyTorch, Brain model parallelisation with MPI, SYCL and OpenMP offloading of percolation models,hand optimizing serial codes for performance.<br>Two submissions due today. Submitted one and finalising my report for the second one. <br>Definitely having a pint after this</p><p><a href="https://fosstodon.org/tags/sycl" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>sycl</span></a> <a href="https://fosstodon.org/tags/hpc" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>hpc</span></a> <a href="https://fosstodon.org/tags/msc" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>msc</span></a> <a href="https://fosstodon.org/tags/epcc" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>epcc</span></a> <a href="https://fosstodon.org/tags/cuda" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>cuda</span></a> <a href="https://fosstodon.org/tags/pytorch" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>pytorch</span></a> <a href="https://fosstodon.org/tags/mpi" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>mpi</span></a> <a href="https://fosstodon.org/tags/openmp" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>openmp</span></a> <a href="https://fosstodon.org/tags/hectic" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>hectic</span></a> <a href="https://fosstodon.org/tags/programming" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>programming</span></a> <a href="https://fosstodon.org/tags/parallelprogramming" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>parallelprogramming</span></a> <a href="https://fosstodon.org/tags/latex" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>latex</span></a></p>
pafurijaz<p>It seems that <a href="https://mastodon.social/tags/Vulkan" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Vulkan</span></a> could be the real alternative for using <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> on GPUs or CPUs of any brand, without necessarily having to rely on <a href="https://mastodon.social/tags/CUDA" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CUDA</span></a> or <a href="https://mastodon.social/tags/AMD" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AMD</span></a>'s <a href="https://mastodon.social/tags/ROCm" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ROCm</span></a>. I thought <a href="https://mastodon.social/tags/SYCL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SYCL</span></a> was the alternative. This might finally free us from of monopoly <a href="https://mastodon.social/tags/Nvidia" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Nvidia</span></a>.<br><a href="https://mastodon.social/tags/Khronos" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Khronos</span></a></p>
Giuseppe Bilotta<p>It's out, if anyone is curious</p><p> <a href="https://doi.org/10.1002/cpe.8313" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">doi.org/10.1002/cpe.8313</span><span class="invisible"></span></a></p><p>This is a “how to” guide. <a href="https://fediscience.org/tags/GPUSPH" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GPUSPH</span></a>, as the name suggests, was designed from the ground up to run on <a href="https://fediscience.org/tags/GPU" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GPU</span></a> (w/ <a href="https://fediscience.org/tags/CUDA" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CUDA</span></a>, for historical reasons). We wrote a CPU version a long time ago for a publication that required a comparison, but it was never maintained. In 2021, I finally took the plunge, and taking inspiration from <a href="https://fediscience.org/tags/SYCL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SYCL</span></a>, adapted the device code in functor form, so that it could be “trivially” compiled for CPU as well.</p><p><a href="https://fediscience.org/tags/HPC" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HPC</span></a> <a href="https://fediscience.org/tags/GPGPU" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GPGPU</span></a></p>
Brett Edmond Carlock<p>Do I have anyone in my wider network with skills in programming CUDA, SYCL, and OpenCL?</p><p>We want to determine feasibility of migrating CUDA-only code to SYCL (via SYCLomatic?): OpenCV feature detection/extraction modules (SIFT, HAGOG, ORB, AKAZE).</p><p>The intent is to upstream all feasible work. </p><p>This, hopefully, should stand to benefit everyone instead of being limited to NVIDIA.</p><p>Currently in info gathering/people connecting phase, not yet funded &amp; ready to go.</p><p><a href="https://mastodon.online/tags/CUDA" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CUDA</span></a> <a href="https://mastodon.online/tags/SYCL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SYCL</span></a> <a href="https://mastodon.online/tags/OpenCL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenCL</span></a> <a href="https://mastodon.online/tags/OpenCV" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenCV</span></a></p>
FCLC<p>Time for an <a href="https://mast.hpc.social/tags/introduction" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>introduction</span></a>! <br>I'm a young Canuck with interests/experience in <a href="https://mast.hpc.social/tags/HPC" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HPC</span></a>, <a href="https://mast.hpc.social/tags/Linux" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Linux</span></a>, <a href="https://mast.hpc.social/tags/BLAS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BLAS</span></a>, <a href="https://mast.hpc.social/tags/SYCL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SYCL</span></a>, <a href="https://mast.hpc.social/tags/C" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>C</span></a>, <a href="https://mast.hpc.social/tags/AVX512" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AVX512</span></a>, <a href="https://mast.hpc.social/tags/Rust" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Rust</span></a>, heterogeneous compute &amp; other such things. </p><p>Currently my personal projects are bringing <a href="https://mast.hpc.social/tags/FP16" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FP16</span></a> to the <a href="https://mast.hpc.social/tags/OpenBLAS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenBLAS</span></a> library, working to standardize what Complex domain BLAS FP16 kernels/implementations should look like, and making sure <a href="https://mast.hpc.social/tags/SYCL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SYCL</span></a> is available everywhere. </p><p>I also write every now and again. Here's the tail of AVX512 FP16 on Alderlake <br><a href="https://gist.github.com/FCLC/56e4b3f4a4d98cfd274d1430fabb9458" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">gist.github.com/FCLC/56e4b3f4a</span><span class="invisible">4d98cfd274d1430fabb9458</span></a></p>