Comments (3)
I found a solution to the issue of garbled text in issue #190. Adding the code:
System.setProperty("sedona.global.charset","utf8");
solved the problem,
but the configuration mentioned on the official website using:
spark.driver.extraJavaOptions -Dsedona.global.charset=utf8
spark.executor.extraJavaOptions -Dsedona.global.charset=utf8
did not work.
from sedona.
spark.driver.extraJavaOptions
works when submitting the Spark application using spark-submit
or using PySpark (#1345), it will alter the Java system properties of the newly spawned Spark driver process. If you are running the main function in local mode all by yourself, System.setProperty
is the proper way of setting Java system properties.
Although System.setProperty
works in this local setup, it does not work when submitting the Spark application to a cluster. The DBF files are parsed by executors, and calling System.setProperty
on the driver won't alter the Java system properties of executors.
The columns of the DataFrame converted from SpatialRDD object are all strings, this is the shortcoming of how we parse DBF files and hold user data in SpatialRDD. The attributes in DBF files are all converted to strings and we don't keep track of their original data types in SpatialRDD. A more proper way to support Shapefiles is by implementing a Shapefile reader based on Spark DataSourceV2, which directly loads Shapefiles as DataFrames.
from sedona.
spark.driver.extraJavaOptions
works when submitting the Spark application usingspark-submit
or using PySpark (#1345), it will alter the Java system properties of the newly spawned Spark driver process. If you are running the main function in local mode all by yourself,System.setProperty
is the proper way of setting Java system properties.Although
System.setProperty
works in this local setup, it does not work when submitting the Spark application to a cluster. The DBF files are parsed by executors, and callingSystem.setProperty
on the driver won't alter the Java system properties of executors.The columns of the DataFrame converted from SpatialRDD object are all strings, this is the shortcoming of how we parse DBF files and hold user data in SpatialRDD. The attributes in DBF files are all converted to strings and we don't keep track of their original data types in SpatialRDD. A more proper way to support Shapefiles is by implementing a Shapefile reader based on Spark DataSourceV2, which directly loads Shapefiles as DataFrames.
Thanks!
from sedona.
Related Issues (20)
- after latest update of sedona snow on snowflake side all functions are gone HOT 2
- datatype geometry is not supported when 'create table xxx (geom geometry)' HOT 3
- AttributeError: 'sedona' has no attribute 'read' HOT 2
- St_isempty(geometry) finds non null geometries but does not find null geometries. HOT 2
- ST_Snap example code does not work HOT 2
- Flink Sedona,geomTbl.execute().print() happen error: HOT 2
- Sedona fails to write Delta Lake on Databricks 15.3 Beta: ClassCastException HOT 7
- ST_IsPolygonCW, ST_IsPolygonCCW, ST_ForcePolygonCW and ST_ForcePolygonCCW fails on Polygons without interior ring
- Breaking change between 1.5.3 and 1.6.0 affecting RASTER functions java.lang.NoSuchMethodError: void org.geotools.coverage.grid.GridGeometry2D HOT 5
- sedona's docker image can not run successfully on k8s HOT 2
- Cannot run sedona examples into spark-shell HOT 3
- Using setMaster(“ spark://master:7077) An error occurred during operation, but there is no problem running locally HOT 3
- Sedona website homepage layout bug in mobile view
- Sedona website favicon checker issues and missing files
- An error occurred when submitting the file. The actual directory does exist.
- Initialization of sedona 1.6.0 is quite slower than sedona 1.5.1 HOT 2
- 1.6.0 documentation needs update? HOT 6
- st_union - understanding difference with PostGIS HOT 6
- Issue with st_dump? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sedona.