bolha.us

0 posts0 participants0 posts today

Christos ArgyropoulosQuestion for the <a href="https://mast.hpc.social/tags/rstats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#rstats</a> crowd. Do you disable hyperthreads when you run analyses in R with a multithreaded version of <a href="https://mast.hpc.social/tags/blas" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#blas</a> e.g. <a href="https://mast.hpc.social/tags/openblas" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#openblas</a> <a href="https://mast.hpc.social/tags/mkl" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#mkl</a> etc ?

Christos Argyropoulos MD, PhD, FASN 🇺🇸Question for the <a class="hashtag" href="https://bsky.app/search?q=%23rstats" rel="nofollow noopener noreferrer" target="_blank">#rstats</a> crowd. Do you disable hyperthreads when you run analyses in R with a multithreaded version of <a class="hashtag" href="https://bsky.app/search?q=%23blas" rel="nofollow noopener noreferrer" target="_blank">#blas</a> e.g. <a class="hashtag" href="https://bsky.app/search?q=%23openblas" rel="nofollow noopener noreferrer" target="_blank">#openblas</a> <a class="hashtag" href="https://bsky.app/search?q=%23mkl" rel="nofollow noopener noreferrer" target="_blank">#mkl</a> etc ?

Christos Argyropoulos5 of these methods can leverage multithreaded (MT) <a href="https://mast.hpc.social/tags/BLAS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#BLAS</a> with a sweet spot ~ 6 threads for the 40% of the time spent in MT regions. E5-2697 has 36/72 (physical/logical) cores, so the avg case scenario is one in which 0.4x3x6 cores +2 (serial methods) tie up ~ 9.2 cores ~13% of the 72 logical cores. So far the back of envelope calculation, i.e. if I run 5 out of the 2100 design points in parallel, I will stay within 15% of resource use is holding rather well! <a href="https://mast.hpc.social/tags/benchmarking" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#benchmarking</a> <a href="https://mast.hpc.social/tags/hpc" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#hpc</a> <a href="https://mast.hpc.social/tags/rstats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#rstats</a>

Christos Argyropoulos MD, PhDMultiple cores to the rescue as I am using a custom D-optimal design to benchmark memory/CPU utilization of 7 alternative implementations of frailty models for big data from <a href="https://mstdn.science/tags/EHRs" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#EHRs</a>. By limiting the number of models that are run simultaneously in <a href="https://mstdn.science/tags/rstats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#rstats</a> to use < 30% of CPU one can treat concurrent runs as independent when evaluating the sweet spot of <a href="https://mstdn.science/tags/BLAS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#BLAS</a> threads (6) for these methods

Christos ArgyropoulosMultiple cores to the rescue as I am using a custom D-optimal design to benchmark memory/CPU utilization of 7 alternative implementations of frailty models for big data from <a href="https://mast.hpc.social/tags/EHRs" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#EHRs</a>. By limiting the number of models that are run simultaneously in <a href="https://mast.hpc.social/tags/rstats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#rstats</a> to use < 30% of CPU one can treat concurrent runs as independent when evaluating the sweet spot of <a href="https://mast.hpc.social/tags/BLAS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#BLAS</a> threads (6) for these methods

FCLCSimple question: what is your *default* BLAS package? <a href="https://mast.hpc.social/tags/HPC" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#HPC</a> <a href="https://mast.hpc.social/tags/BLAS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#BLAS</a>

FCLCTime for an <a href="https://mast.hpc.social/tags/introduction" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#introduction</a>! I'm a young Canuck with interests/experience in <a href="https://mast.hpc.social/tags/HPC" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#HPC</a>, <a href="https://mast.hpc.social/tags/Linux" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#Linux</a>, <a href="https://mast.hpc.social/tags/BLAS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#BLAS</a>, <a href="https://mast.hpc.social/tags/SYCL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#SYCL</a>, <a href="https://mast.hpc.social/tags/C" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#C</a>, <a href="https://mast.hpc.social/tags/AVX512" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#AVX512</a>, <a href="https://mast.hpc.social/tags/Rust" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#Rust</a>, heterogeneous compute & other such things. Currently my personal projects are bringing <a href="https://mast.hpc.social/tags/FP16" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#FP16</a> to the <a href="https://mast.hpc.social/tags/OpenBLAS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#OpenBLAS</a> library, working to standardize what Complex domain BLAS FP16 kernels/implementations should look like, and making sure <a href="https://mast.hpc.social/tags/SYCL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#SYCL</a> is available everywhere. I also write every now and again. Here's the tail of AVX512 FP16 on Alderlake <a href="https://gist.github.com/FCLC/56e4b3f4a4d98cfd274d1430fabb9458" rel="nofollow noopener noreferrer" translate="no" target="_blank">https://gist.github.com/FCLC/56e4b3f4a4d98cfd274d1430fabb9458</a>

Recent searches

Search options

Administered by:

Server stats:

#blas