No-Storage implementation

# No-Storage implementation ###### tags: `functional cycle 9` `storage` developer: Linus (half cycle) ## ToDo Shaping: * What about violating layout in GT backends: Do we just not enforce it? Continue to raise? ## Description We propose to replace the storage interface in cartesian gt4py with a new approach that does not rely on custom types but rather leverages existing standards and only adds a minimal custom interface where necessary. The new interface will be shared among cartesian and unstructured gt4py. The full specification can be found in [this document](https://hackmd.io/kl_H-FZ5SvirNp_28qQI5Q), which is also to be finalized during the cycle. The key points are: * There are no Storage classes in gt4py * There is no mechanism in gt4py copying data from CPU to GPU or vice-versa. * We rely on existing standard interfaces to expose buffers for use in stencils rather than creating own classes or interfaces. * We do provide utilities `zeros`, `ones`, `full` and `empty` and their corresponding `{}_like` counterparts to allocate arrays that have an appropriate layout, alignment. The signature of those will be similar to those proposed in GDP-3. In case of GPU backends, the result is a CuPy ndarray, for CPU a NumPy ndarray. * The semantic meaning of dimensions of arrays passed to stencils is looked up according to 1. the `dims` attribute of the array if present 2. the annotation of the corresponding argument to the stencil * While any type implementing the standard interfaces is understood by the stencils, an array can still be with incompatible e.g. due to its layout not being supported in the backend. ## Tasks For this cycle, we propose to remove the existing storage and allocation facilities and implement the allocation utilities in a common location shared between cartesian and functional versions, similar to `eve`. Since bindings are expected to go on separately, we propose to change the StencilObject of cartesian GT4Py to support the new interface, while leaving support in functional to the bindings project. * Remove all current storage classes * Move allocation routines to common repository with functional * Change return type of allocation routines to directly return numpy or cupy ndarrays. * Change allocation routine signature (starting point can be [gronerl/gtp-3-implementation](https://github.com/gronerl/gt4py/tree/gdp-3-implementation)) * Adapt StencilObject to work with arbitrary buffers. * Implement `dims` lookup, adapt `default_origin` lookup in cartesian GT4Py. (starting point can be [gronerl/gtp-3-implementation](https://github.com/gronerl/gt4py/tree/gdp-3-implementation)) * Finalize [Storage.md](https://hackmd.io/kl_H-FZ5SvirNp_28qQI5Q) and add to the [gt4py concepts wiki](https://github.com/GridTools/concepts/wiki). * Adapt existing tests and demo scripts to work without storages. ## No-Goals Performance is not a primary concern in extracting buffers. We expect to be able to rely heavily on NumPy and CuPy's `asarray` and `from_dlpack` routines. Should those prove to introduce too much overhead, a more direct interface can be introduced later. ## Appetite As allocation per se is already solved, this work largely comprises in removing code and adapting examples. The implementation of the `dims` interface should be straight forward as it just requires permuting the dimensions appropriately which was previously already implemented in [gronerl/gtp-3-implementation](https://github.com/gronerl/gt4py/tree/gdp-3-implementation). We therefore expect the above tasks to be completed by **1 developer in half a cycle.**