Loading...

XML

Word

Printable

Type: Story
Resolution: Done
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: odh-manifest
Labels:
- feature

Sprint:
Open Data Hub Sprint 10, Open Data Hub Sprint 11, Open Data Hub Sprint 12, Open Data Hub Sprint 13, Open Data Hub Sprint 14, Open Data Hub Sprint 15
Git Pull Request:
https://github.com/opendatahub-io/odh-manifests/pull/451
Story Points:
3

Xskipper is An Extensible Data Skipping Framework, it provides a library for creating, managing and deploying data skipping indexes with Apache Spark to boosts performance and reduce cost by skipping over irrelevant data. It supports multiple data formats: Parquet, CSV, JSON, ORC and Avro.
Hive tables are supported.
Out of the box indexes supported include MinMax, ValueList and BloomFilter indexes, as well as data skipping for User Defined Functions.

Adding Xskipper (https://xskipper.io) library to spark based Jupiter notebooks, by including the maven dependency in pyspark packages provides ODH users with native data skipping support in spark notebooks.

Assignee:: Juana Nakfour (Inactive)

Reporter:: Oshrit Feder (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2021/08/18 10:19 AM

Updated:: 2022/11/12 7:37 PM

Resolved:: 2022/02/03 2:57 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates