Ozone
pyarrow
which use the Java client under the hood.pyarrow
uses _hadoop_classpath_glob
to get all Hadoop jar and required libraries.
If we want to make pyarrow
support o3fs
, we should let hadoop-ozone-filesystem-0.5.0-SNAPSHOT.jar
and hadoop-ozone-filesystem-lib-current-0.5.0-SNAPSHOT.jar
be get by _hadoop_classpath_glob
.
We can use export HADOOP_CLASSPATH=$HADOOP_CLASSPATH:$OZONE_HOME/share/ozone/lib/hadoop-ozone-filesystem-lib-current-$OZONE_VERSION.jar
pyarrow
version: 0.15.1Hadoop
version: 3.2.1Ozone
version: master branchcore-site.xml
of Ozone
python3.x python_code.py