v0.15: Experimental new CSV-, and Geographic integrations and many other fixes #991
Jolanrensen
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This release contains several new features, tons of fixes and two new exciting experimental new integrations:
GeoDataFrameclass for working with geographical data (from GeoJson/Shapefile) and plotting it with Kandy. See below for more information.BigIntegersupport:Just like we support the
BigDecimalnumbers, DataFrame now also supportsBigIntegerin parsing, converting, statistics, column arithmetics, etc.Parsing and converting
Stringcolumns to other types is now faster. We also introduce the new experimentalParserOptions.useFastDoubleParsersetting which uses FastDoubleParser for faster and more flexibleDoubleparsing.New Experimental CSV integration
DataFrame's CSV parsing has been based on Apache Commons CSV from the beginning. While this has been sufficient for most applications, it had some issues like running out of memory, performance, and our API lacking in clarity, documentation, and completeness.
For DataFrame 0.15, we introduce a new separate package
org.jetbrains.kotlinx:dataframe-csvwhich tries to solve all these issues at once. It's based on Deephaven-CSV which makes it faster and more memory efficient. And since we built it from the ground up, we made sure the API was complete, predictable, and documented carefully.To try it yourself, explicitly add the dependency
org.jetbrains.kotlinx:dataframe-csvto your project. In notebooks you can addenableExperimentalCsv=trueto the %use-magic, like%use dataframe(enableExperimentalCsv=true).Use the new
DataFrame.readCsv()/DataFrame.readTsv()/DataFrame.readDelim()functions over the oldDataFrame.readCSV()ones.We happily await your feedback!
New Experimental Geo integration
Kandy v0.8 introduces geo-plotting which allows you to visualize geospatial/geographical data using the awesome Kandy DSL. To make working with this geographical data (from GeoJson/Shapefile) easier, we happily accepted the GeoDataFrame PR from the Kandy team.
To try it yourself, explicitly add the dependency
org.jetbrains.kotlinx:dataframe-geoto your project or notebook (with the repositorymaven("https://repo.osgeo.org/repository/release")) and useGeoDataFrame.readGeoJson()orGeoDataFrame.readShapeFile()to get started!Features
BigIntegersupport by @Jolanrensen in FullBigIntegersupport #972Compiler Plugin
Fixes
Anycol.isValueColumnetc. for smart-casting by @Jolanrensen in Adding contracts forAnycol.isValueColumnetc. for smart-casting #882DataFrame.read("", delimiter =)by @Jolanrensen in DeprecatingDataFrame.read("", delimiter =)#902Anyby changing visibility check by @Jolanrensen in Fixed local classes being inferred asAnyby changing visibility check #929describe()fixes by @Jolanrensen indescribe()fixes #937startsWithandendsWithin CS DSL to Error by @Jolanrensen in Bumped deprecations ofstartsWithandendsWithin CS DSL to Error #978Docs and Examples
New Contributors
Full Changelog: v0.14.2...v0.15.0
This discussion was created from the release v0.15: Experimental new CSV-, and Geographic integrations and many other fixes.
Beta Was this translation helpful? Give feedback.
All reactions