nrw.social ist einer von vielen unabhängigen Mastodon-Servern, mit dem du dich im Fediverse beteiligen kannst.
Wir sind eine freundliche Mastodon Instanz aus Nordrhein-Westfalen. Ob NRW'ler oder NRW-Sympathifanten, jeder ist hier willkommen.

Serverstatistik:

2,8 Tsd.
aktive Profile

#ApacheArrow

0 Beiträge0 Beteiligte0 Beiträge heute
Fabian<p><span class="h-card" translate="no"><a href="https://mstdn.science/@ChristosArgyrop" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>ChristosArgyrop</span></a></span> While this has truth, there is still <a href="https://mastodon.social/tags/DuckDB" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DuckDB</span></a> and <a href="https://mastodon.social/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a> that can help tons with large datasets before we need actual SQL! Don't know if that's counting as giggles though 😅</p>
Data Quine<p>My browser WASM’t prepared for this. Using DuckDB, Apache Arrow and Web Workers in real life | by Motif Analytics | Feb, 2025 | Medium </p><p><a href="https://motifanalytics.medium.com/my-browser-wasmt-prepared-for-this-using-duckdb-apache-arrow-and-web-workers-in-real-life-e3dd4695623d" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">motifanalytics.medium.com/my-b</span><span class="invisible">rowser-wasmt-prepared-for-this-using-duckdb-apache-arrow-and-web-workers-in-real-life-e3dd4695623d</span></a></p><p><a href="https://datasci.social/tags/DuckDB" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DuckDB</span></a> <a href="https://datasci.social/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a> <a href="https://datasci.social/tags/DataAnalysis" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataAnalysis</span></a></p>
André Ourednik<p><a href="https://mastodon.social/tags/apachearrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>apachearrow</span></a> and <a href="https://mastodon.social/tags/gdal" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>gdal</span></a> both rely on aws-c-cmmon and related packages</p><p>Having no trust in anything <a href="https://mastodon.social/tags/Amazon" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Amazon</span></a> related (though there are certainly good people at AWS Labs, too), a question:</p><p>Isn't there a way to make arrow and gdal depend on some other packages?</p><p><span class="h-card" translate="no"><a href="https://fosstodon.org/@jorisvandenbossche" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>jorisvandenbossche</span></a></span> <span class="h-card" translate="no"><a href="https://mastodon.social/@gdal" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>gdal</span></a></span></p>
amoeba<p>🏹 We’re excited to announce the release of {arrow} 17.0.0.1. Binary packages are now available for all platforms from both CRAN and R-Universe (<a href="https://apache.r-universe.dev/arrow" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">apache.r-universe.dev/arrow</span><span class="invisible"></span></a>). This release includes some nice quality of life improvements for folks writing dplyr pipelines with arrow. Have a look below to see what’s changed or see the full changelog <a href="https://arrow.apache.org/docs/r/news/index.html" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">arrow.apache.org/docs/r/news/i</span><span class="invisible">ndex.html</span></a> for all the info. <a href="https://toot.cafe/tags/rstats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>rstats</span></a> <a href="https://toot.cafe/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a></p>
Nic Crane<p>We're delighted to announce that "Scaling Up with R and Arrow", by Nic Crane, Jonathan Keane and Neal Richardson is now available online at <a href="http://www.arrowrbook.com" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">http://www.</span><span class="">arrowrbook.com</span><span class="invisible"></span></a>. In the book, we cover a lot of the practical details and theory behind working with Arrow in R. The paper version will be available soon! <a href="https://mastodon.social/tags/rstats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>rstats</span></a> <a href="https://mastodon.social/tags/apachearrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>apachearrow</span></a></p>
Alejandro Baez<p>Messing about with <a href="https://fosstodon.org/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a> recently. And ended up discovering <a href="https://fosstodon.org/tags/glaredb" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>glaredb</span></a>. A <a href="https://fosstodon.org/tags/rust" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>rust</span></a> embedded db, in the style of <a href="https://fosstodon.org/tags/duckdb" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>duckdb</span></a>. 😎 </p><p>Though I can't say much on it yet, as I haven't yet fully grasped you wouldn't use duckdb instead. 😅</p><p>Still. Very much liking competition in this new age <a href="https://fosstodon.org/tags/OLAP" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OLAP</span></a> space. 😄</p><p><a href="https://glaredb.com/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">glaredb.com/</span><span class="invisible"></span></a></p>
rickspencer3<p><span class="h-card" translate="no"><a href="https://gladtech.social/@cuchaz" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>cuchaz</span></a></span> have you taken a look at <a href="https://social.lol/tags/apachearrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>apachearrow</span></a> and parquet?</p>
Stas Kolenikov<p>I am working in a project that has bits of <a href="https://mastodon.online/tags/rstats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>rstats</span></a>, bits of <a href="https://mastodon.online/tags/python" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>python</span></a> and bits of <a href="https://mastodon.online/tags/cpp" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>cpp</span></a> and they are all supposed to be united by <a href="https://mastodon.online/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a>. How do I access externally created Arrow objects in R? Everything that is in <a href="https://r4ds.hadley.nz/arrow.html" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">r4ds.hadley.nz/arrow.html</span><span class="invisible"></span></a>, <a href="https://arrow-user2022.netlify.app/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">arrow-user2022.netlify.app/</span><span class="invisible"></span></a> (with huge thanks to <span class="h-card" translate="no"><a href="https://hachyderm.io/@djnavarro" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>djnavarro</span></a></span>), <a href="https://arrow.apache.org/cookbook/r" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">arrow.apache.org/cookbook/r</span><span class="invisible"></span></a> (thanks to <span class="h-card" translate="no"><a href="https://fosstodon.org/@nic_crane" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>nic_crane</span></a></span>) talks about creating large Arrow-ish objects in R by reading external data in csv or parquet rather than connecting to the existing Arrow sources.</p>
Jacob Scott<p>Hey <a href="https://fosstodon.org/tags/RStats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>RStats</span></a>, keen to get takes on this.</p><p>I really like data-management solutions like <a href="https://fosstodon.org/tags/duckdb" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>duckdb</span></a>, <a href="https://fosstodon.org/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a>, <a href="https://fosstodon.org/tags/sqlite" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>sqlite</span></a> etc which help you manage large datasets whilst keeping your analysis local.</p><p>My question is, how do you approach version control with these tools? How do you make your work reproducible? Not talking about your code (just use Git), but the data it operates on. I can create a duckdb database, but it's not obvious how I should share this with others.</p><p><a href="https://fosstodon.org/tags/DataScience" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataScience</span></a> <a href="https://fosstodon.org/tags/Database" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Database</span></a> <a href="https://fosstodon.org/tags/sql" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>sql</span></a></p>
FOSSlife<p>Inaugural recipients of newly-established FOSS Contributor Fund announced by Bloomberg <a href="https://www.fosslife.org/bloomberg-announces-first-recipients-new-foss-fund" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">fosslife.org/bloomberg-announc</span><span class="invisible">es-first-recipients-new-foss-fund</span></a> <a href="https://fosstodon.org/tags/FOSS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FOSS</span></a> <a href="https://fosstodon.org/tags/funding" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>funding</span></a> <a href="https://fosstodon.org/tags/Bloomberg" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Bloomberg</span></a> <a href="https://fosstodon.org/tags/software" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>software</span></a> <a href="https://fosstodon.org/tags/OpenSource" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenSource</span></a> <a href="https://fosstodon.org/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a> <a href="https://fosstodon.org/tags/Curl" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Curl</span></a> <a href="https://fosstodon.org/tags/Celery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Celery</span></a></p>
Sharon Machlis<p><span class="h-card"><a href="https://social.coop/@eamon" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>eamon</span></a></span> Although you can use :rstats: for data that won't fit in memory too 😀 <br><span class="h-card"><a href="https://fosstodon.org/@thomas_mock" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>thomas_mock</span></a></span> 's lightning talk at last year's Arrow conference<br>Video <a href="https://www.youtube.com/watch?v=LvTX1ZAZy6M" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">youtube.com/watch?v=LvTX1ZAZy6</span><span class="invisible">M</span></a><br>Slides <a href="https://jthomasmock.github.io/arrow-dplyr/#/" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://</span><span class="ellipsis">jthomasmock.github.io/arrow-dp</span><span class="invisible">lyr/#/</span></a><br><a href="https://fosstodon.org/tags/rstats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>rstats</span></a> <a href="https://fosstodon.org/tags/DuckDB" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DuckDB</span></a> <a href="https://fosstodon.org/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a></p>
Daniel Hocking<p>I was just recommending <span class="h-card"><a href="https://fosstodon.org/@djnavarro" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>djnavarro</span></a></span> posts about <a href="https://bayes.club/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a> <a href="https://bayes.club/tags/Parquet" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Parquet</span></a> and <a href="https://bayes.club/tags/DataScience" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataScience</span></a> in <a href="https://bayes.club/tags/RStats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>RStats</span></a> to someone and decided I'd share here too. She writes fantastic intros and explanations in entertaining posts, tutorials, and courses.</p><p><a href="https://blog.djnavarro.net/posts/2021-11-19_starting-apache-arrow-in-r/" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.djnavarro.net/posts/2021-</span><span class="invisible">11-19_starting-apache-arrow-in-r/</span></a></p><p><a href="https://blog.djnavarro.net/posts/2022-11-30_unpacking-arrow-datasets/" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.djnavarro.net/posts/2022-</span><span class="invisible">11-30_unpacking-arrow-datasets/</span></a></p><p><a href="https://arrow-user2022.netlify.app/" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://</span><span class="">arrow-user2022.netlify.app/</span><span class="invisible"></span></a></p>
Gunnar Morling<p>🗣️ "The central idea behind Flight is deceptively simple: it provides a standard protocol for transferring Arrow data over a network"</p><p>Great post by <span class="h-card"><a href="https://fosstodon.org/@djnavarro" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>djnavarro</span></a></span>; <a href="https://mastodon.online/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a> Flight (and Flight SQL) is super-interesting, definitely keep an eye on it in '23.</p><p><a href="https://blog.djnavarro.net/posts/2022-10-18_arrow-flight/" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.djnavarro.net/posts/2022-</span><span class="invisible">10-18_arrow-flight/</span></a></p>
Danielle Navarro<p>For reasons unknown she is blogging again. I am so sorry, but should you happen to be curious about how Dataset objects work in the <a href="https://fosstodon.org/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a> <a href="https://fosstodon.org/tags/RStats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>RStats</span></a> package, and enjoy me being mildly irritable about... things, this post may be of some interest? :blobcatheart: </p><p><a href="https://blog.djnavarro.net/unpacking-arrow-datasets" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.djnavarro.net/unpacking-a</span><span class="invisible">rrow-datasets</span></a></p>
Kae Suarez<p>Hello! I'm Kae, and this is my Hachyderm. </p><p>I want to be clear about what's going to happen here. </p><p>I am going to be making a lot of posts that are me floundering with tech. That's the point. As a person in DevRel, I think it's important to be honest about floundering now and then, and let others, especially engineers, see some pain points. </p><p>Plus, it can be entertaining, and gives me tons of draft material for my blog. </p><p>I hope I can show everyone some cool things!</p><p><a href="https://hachyderm.io/tags/python" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>python</span></a> <a href="https://hachyderm.io/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a></p>
Danielle Navarro<p>My favourite trick for working with huge data sets in R. If your dataset is larger than memory and the query result is also larger than memory, you can still use dplyr/arrow pipelines. Example:</p><p>library(arrow)<br>library(dplyr)</p><p>nyc_taxi &lt;- open_dataset("nyc-taxi/")<br>nyc_taxi |&gt;<br> filter(payment_type == "Credit card") |&gt;<br> group_by(year, month) |&gt;<br> write_dataset("nyc-taxi-credit")</p><p>Input is 1.7 billion rows (70GB), output is 500 million (15GB). Takes 3-4 mins on my laptop 🙂 </p><p><a href="https://fosstodon.org/tags/rstats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>rstats</span></a> <a href="https://fosstodon.org/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a></p>
Danielle Navarro<p>I am diving into the internals trying to understand how the arrow <a href="https://fosstodon.org/tags/rstats" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>rstats</span></a> package wraps the <a href="https://fosstodon.org/tags/ApacheArrow" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ApacheArrow</span></a> Datasets API and the bar is playing "Rebel Girl" by Bikini Kill. Honestly living my best life</p>