365体育投注

Essential Open-Source Library pandas Awarded CZI Grant to Further Development

 

We’re pleased to announce that pandas, the open-source library providing high-performance data structures for tabular data analysis, has received grant funding from the (CZI) as part of their program. This funding will help the pandas project continue to thrive. Anaconda has supported pandas development over the years, most recently through contributions from employees Brock Mendel and Tom Augspurger.

We were happy to see that the CZI explicitly acknowledged the for foundational open-source projects. While pandas is fortunate to have some — including Anaconda —- securing funding for development is difficult , and maintaining a project and community as large and active as pandas is a time-consuming endeavor. 

Basic project maintenance is often one of the first things to cut for over-worked maintainers. We’re explicitly dedicating a portion of the funding to increased project maintenance. This will mean fewer open issues, higher average quality and clarity of open issues, faster responses to new issues, and faster and better reviews on pull requests. All of this adds up to a better contributing experience and a more stable pandas.

We also plan to fund development time on some of our larger items. Specifically, we’ll work on:

  • Improved Extension Array Interface: We recently introduced an interface for storing custom array-like objects inside pandas’ data structures. This is a large change to pandas and would benefit from dedicated time to improve the interface and implementation.
  • Native String Refactor: Change how we store and process strings, resulting in lower memory usage and higher performance on text datasets.

We’re excited to put these funds to use to ensure the continued health of the pandas project and community.

More information about the Essential Open Source Software program is available in the .


You May Also Like

Enterprise Data Science
AI Opportunities for Financial Services Companies
By Michael Grant AI is undeniably a hot topic right now, and financial services companies are not immune to the hype. And in truth, they shouldn’t be: the applications o...
Read More
For Practitioners
Intake: Caching Data on First Read Makes Future Analysis Faster
By Mike McCarty Intake provides easy access data sources from remote/cloud storage. However, for large files, the cost of downloading files every time data is read can be extr...
Read More
For Practitioners
Anaconda Training: A Learning Path for Data Scientists
Here at Anaconda, our mission has always been to make the art of data science accessible to all. We strive to empower people to overcome technical obstacles to data analysis s...
Read More