[GTALUG] Running Dell branded Nvidia gtx 1060 in non-dell system

Wed Aug 7 11:22:36 EDT 2019

On Wed, Aug 7, 2019 at 9:54 AM D. Hugh Redelmeier via talk
<talk at gtalug.org> wrote:
>
> | From: xerofoify via talk <talk at gtalug.org>
>
> | On Tue, Aug 6, 2019 at 4:23 PM Alex Volkov via talk <talk at gtalug.org> wrote:
>
> | I don't know how much your intending to do with that GPU or otherwise.
>
> He said he wants to do human (himself) learning about machine
> learning.  Less cute way of saying this: he want to experiment and
> play with ML.
>
> | If your just using Nvidia I can't help
> | you as mentioned but if your interested in GPU workloads  was looking
> | at the AMDGPU backend for LLVM.
>
> Most people learning (or doing) ML pick a tall stack of software and
> the learn almost nothing about the underlaying hardware.  I admint
> that sometimes performance issues poke their way through those levels.
>
> If I remember correctly, the base of the stack that Alex was playing
> with was Google's TensorFlow.  Of course there is stuff below that but
> the less he has to know about it the better.
>
> See Alex's discussion about getting TensorFlow to work on AMD.  If I
> understood him correctly (maybe not), the normal route to TesorFlow on
> AMD is through ROCm, and that won't work on an APU.  Too bad.
>
> My guess: even if he could run ROCm, he might hit some hangups with
> TensorFlow since the most used path to TensorFlow is Nvidia cards and
> (I think) Cuda.  It's always easier to follow a well-worn path.
>
> I, on the other hand, think I'm interested in the raw hardware.  I
> have not put any time into this but I intend to (one of many things I
> want to do).
>
> | Not sure if there is one that targets Nvidia cards but it may be of
> | interest to you as you would be able to
> | compile directly for the GPU rather than using an API to access it.
> | Not sure about Nvidia so double check
> | that.
>
> As I understand it:
>
> - LCC targets raw AMD GPU hardware
>
> - that's probably not very useful because runtime support is needed
>   for what you could consider OS-like functionality and that isn't
>   provided.
>
>   + scheduling
>
>   + communicating with the host computer
>
>   + failure handling and diagnostics
>
> - Separately, a custom version of LCC is used as a code generator
>   (partly(?) at runtime!) for OpenCL.  I think that AMD tries to
>   "upstream" their LCC changes but this is never soon enough.
>
> - I think that nvidia also has a custom LCC but does not try to
>   upsteam all of their goodies (LCC is not copylefted).
>
> I may be wrong about much or all of this.  I would like to know an
> accurate, comprehensive, comprehensible source for this kind of
> information.
>
> | Here is the official documentation for AMD through:
> | https://llvm.org/docs/AMDGPUUsage.html
>
> Thanks.  I'll have a look.
>
> | If your using it for machine learning it may be helpful to be aware of
> | it
>
> You'd think so but few seem to bother.  There's enough to get ones
> head around at the higher levels of abstraction.
>
> Much ML seems to be done via cook-books.
>
> | Hopefully that helps a little,
>
> I'd love to hear a GTALUG talk about the lower levels.  Perhaps a
> lightning talk next week would be a good place to start.
> ---
I won't be giving a talk about this but Alex and me were going to plan for
me to do some sort of Q and A session in the fall. Mostly related to
this or other compiler questions. I've found a lot of people have questions
especially about optimizations and backends. Not sure what month
but this will give you a heads up.

AMD's hardware is open source and OpenCL is in both compilers these
days. The biggest difference is that AMD cards have a backend for
targeting the actual hardware. On the other hand Nvidia doesn't at
least have a open source one. So you can compile to native assembly
for AMDGPUs after a certain release. It mentions it in the guide. I posted.

If your really interested on this page at the bottom are the reference
manuals for
the GPUs themselves and may help you with debugging if your going to play
around with one:
https://developer.amd.com/resources/developer-guides-manuals/

That should be enough for now,
Nick

> Post to this mailing list talk at gtalug.org
> Unsubscribe from this mailing list https://gtalug.org/mailman/listinfo/talk