Skip to content

GitLab

  • Menu
Projects Groups Snippets
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Sign in
  • A ASGarD
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
    • Locked Files
  • Issues 4
    • Issues 4
    • List
    • Boards
    • Service Desk
    • Milestones
    • Iterations
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Container Registry
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Code review
    • Issue
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • FK6D
  • ASGarD
  • Issues
  • #11

Closed
Open
Created Apr 03, 2018 by Green, David@dg6Owner

GPU parallelization scheme confusion

After looking at the ApplyA function within https://code.ornl.gov/lmm/DG-SparseGrid/blob/master/Vlasov-Poisson-version2/TimeAdvance.m in @lmm 's code, I'm a touch concerned about how this gets parallelized on the GPU. My understanding is that we want each thread to do the same thing, but here, if we were to parallelize over rows (DOF), then due to the connectivity of each row being different (i.e., the number of nonzeros in each row is different), then each thread would have a different amount of work. I may be a step behind here and @elwasif and @atj and @e6d have already come up with a GPU friendly approach to parallelizing the application of the matrix-vector multiply for a sparse matrix like ours. Perhaps someone can educate me - or is the answer as simple as parallelizing over all the elements within A, not just rows?

To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Assignee
Assign to
Time tracking