News

High Performance Linpack: Version 1.0.1 Release

Added by Matthias Bach almost 7 years ago

Version 1.0.1 is a maintenance release that cleans up version 1.0.0 and fixes some issues found in certain system setups. Most changes are improvements to the build system and its documentation.

You can download the tarball from the files section or get the source via tag v1.0.1 from the [[git repository]]. Installation instruction can be found in the [[wiki]]. If you are interested in some more in depth information you should have a look at the Technical Report that was published along with CALDGEMM 1.0.0.

High Performance Linpack: Version 1.0.0 Release

Added by Matthias Bach almost 7 years ago

We are pleased to announce the release of [[100|HPL-GPU 1.0]]. This largely rewritten version of Linpack can sit on top of CALDGEMM, our high performance DGEMM library for AMD GPUs.

This is the code version that reached 285 Gflops on LOEWE-CSC, pushing the system to position 22 in this falls Top500 list.

Besides support for GPUs this version also features massive improvements to single process per node and initialization performance as well as lookahead, not to mention the many improvements in small details that help with things as debugging the code and the system.

You can download the tarball from the files section or get the source via tag v1.0.0 from the [[git repository]]. Installation instruction can be found in the [[wiki]]. If you are interested in some more in depth information you should have a look at the Technical Report that was published along with CALDGEMM 1.0.0.

CALDGEMM: Version 1.0.0 Code and Documentation Release

Added by Matthias Bach almost 7 years ago

We are happy to be able to release version 1.0.0 of CALDGEMM along with a Technical Report describing its inner workings and how we integrated the library into a modified version of the High Performance Linpack. You can get the tarball from the files section of the website or clone the [[git repository]].

CALDGEMM allows you to use AMD GPUs to achieve the highest DGEMM performance possible on current hardware, basically reaching peak performance. It utilizes GotoBLAS for the CPU side of combined CPU/GPU computation.

Vc: Vc 0.5 Alpha 1 Released

Added by Matthias Kretz almost 7 years ago

In preparation of stabilizing the API the 0.5 release series is paving the way for Vc 1.0.

Compared to the 0.4 series 0.5 cleans up the API some more and introduces the following features:
  • prefetch API
  • simple malloc API for alignments
  • much improved and more flexible load/store API
  • ready to use cmake modules
  • more robust internal aliasing implementation
  • faster short_v division
  • improved compatibility with compilers and assemblers
Release Notes:
  • The MacOS assembler detection is currently not working correctly - though the result of this should be correct.
  • There have been reports from Fedora systems that compilation fails. This was not reproducible with vanilla GCC installations of any version.
  • GCC 4.5.1 reports an internal compiler error: http://gcc.gnu.org/bugzilla/show_bug.cgi?id=46723

Download:
Vc-0.4.80.tar.gz

Vc: Vc 0.4 Released

Added by Matthias Kretz almost 7 years ago

After 0.4 was in beta for a long time with little feedback but a few important backports from master, I today release 0.4.0.

Release Notes:
This release should work with GCC 4.x.x and even older binutils releases. Other compilers are known to make problems, but please report any problems nevertheless.

Download:
Vc-0.4.0.tar.gz

High Performance Linpack: Release Approaching

Added by Matthias Bach almost 7 years ago

We are currently sorting out the last documentation and licensing issues. Once that is done, which will be before Christmas, the code will be published along with some technical documentation.

CALDGEMM: Release Approaching

Added by Matthias Bach almost 7 years ago

We are currently sorting out the last documentation and licensing issues. Once that is done, which will be before Christmas, the code will be published along with some technical documentation.

« Previous 1 2 3

Also available in: Atom