    * Native matrices now determine indices for data transfer at assemble-time
    * Changed to Waitany for nonblocking
    * Moved local computation to end
    * Updated NNZ to be compatibly with PETSc o/dnnz parameters
    * Resolved CI python version mismatch
    * Fixed a bug where too much memory was being allocated by Native matrix init
