Comments (4)
I cannot reproduce. What does valgrind say? And have you tried with latest Polars?
from polars.
I just downgraded to polars==0.20.15 and pyarrow==15.0.0 and still I can't reproduce although this env has python 3.11.9
--------Version info---------
Polars: 0.20.15
Index type: UInt32
Platform: Linux-5.15.146.1-microsoft-standard-WSL2-x86_64-with-glibc2.35
Python: 3.11.9 (main, Apr 6 2024, 17:59:24) [GCC 11.4.0]
----Optional dependencies----
adbc_driver_manager: <not installed>
cloudpickle: <not installed>
connectorx: 0.3.2
deltalake: <not installed>
fastexcel: <not installed>
fsspec: <not installed>
gevent: <not installed>
hvplot: <not installed>
matplotlib: 3.8.4
numpy: 1.26.4
openpyxl: <not installed>
pandas: 2.2.2
pyarrow: 15.0.0
pydantic: <not installed>
pyiceberg: <not installed>
pyxlsb: <not installed>
sqlalchemy: <not installed>
xlsx2csv: <not installed>
xlsxwriter: <not installed>
from polars.
Ok thanks for checking in on this, I may have something far more wrong with my system then.
❯ python3
Python 3.12.3 (main, Apr 9 2024, 08:09:14) [GCC 13.2.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import polars as pl
>>> pl.DataFrame({"a": [1,2,3], "b":[4,5,6]})
shape: (3, 2)
┌─────┬─────┐
│ a ┆ b │
│ --- ┆ --- │
│ i64 ┆ i64 │
╞═════╪═════╡
│ 1 ┆ 4 │
│ 2 ┆ 5 │
│ 3 ┆ 6 │
└─────┴─────┘
>>> pl.show_versions
<function show_versions at 0x7f343ed68cc0>
>>> pl.show_versions()
fish: Job 1, 'python3' terminated by signal SIGSEGV (Address boundary error)
I went ahead and upgraded my versions and same issue. Seemingly works fine from an jupyter notebook however. I fear something is really wrong with my system. Ill go ahead and close this.
from polars.
I have narrowed it down to, in my Nix configuration, when I have PyArrow enabled, the error happens as soon as I try to use Polars for certain activities. However, what is odd is I do not get any of these errors when running in a Jupyter notebook.
(python312.withPackages (ppkgs: with ppkgs; [
polars
pyarrow
numpy
pandas
scipy
matplotlib
seaborn
boto3
tqdm
pyyaml
requests
ipython
ipykernel
humanize
]))
Is there a certain configuration i need to pass to valgrind to get this to report something useful?
❯ valgrind --leak-check=yes python3 -c "import polars as pl; pl.show_versions()"
==39677== Memcheck, a memory error detector
==39677== Copyright (C) 2002-2022, and GNU GPL'd, by Julian Seward et al.
==39677== Using Valgrind-3.22.0 and LibVEX; rerun with -h for copyright info
==39677== Command: python3 -c import\ polars\ as\ pl;\ pl.show_versions()
==39677==
fish: Job 1, 'valgrind --leak-check=yes pytho…' terminated by signal SIGSEGV (Address boundary error)
from polars.
Related Issues (20)
- Expressified `expr.name.map` HOT 2
- `.square` as syntactic sugar for `.pow(2)`
- add a margin of error method for Confidence Intervals (CI)
- raise_if_empty=False does not suppress NoDataError if skip_rows_after_header is passed and no data rows remain
- read_csv failure: provide line number not just column and bytes on parse failure HOT 2
- sink_parquet_all support HOT 1
- Cannot initialize series with empty 2D numpy array
- Add a column if not present else no op
- "WHERE true/false" breaks in SQLContext on DataFrame with null columns
- New String/Schema
- Inconsistent type behavior in when().then() with str.to_decimal() and pl.lit() HOT 7
- `ewm_mean_by` documented formula for the decay parameter is incorrect
- Generator output support HOT 4
- PanicException when printing df with Config `set_tbl_width_chars(-1)`
- support +vL datatype in rust HOT 1
- assert_series_equal and assert_frame_equal are inconsistent HOT 1
- Does `arg_max` always return the first maximal index?
- scan_parquet returns ComputeError if there are no parquet files HOT 5
- min and max return all nulls in pl.Enum HOT 1
- Support integer period on a temporal column HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from polars.