There is more to ‘pandas.read_csv()’ than meets the eye

A deep dive into some of the parameters of the read_csv function in pandas Pandas is one of the most widely used libraries in the Data Science ecosystem. This versatile library gives us tools to read, explore and manipulate data in Python. The primary tool used for data import in pandas is read_csv().This function accepts the file path of a … Continue reading There is more to ‘pandas.read_csv()’ than meets the eye

Reducing memory usage in pandas with smaller datatypes

Optimizing pandas memory usage by the effective use of datatypes Managing large datasets with pandas is a pretty common issue. As a result, a lot of libraries and tools have been developed to ease that pain. Take, for instance, the pydatatable library mentioned below. Using Python’s datatable library seamlessly on Kaggle Despite this, there are … Continue reading Reducing memory usage in pandas with smaller datatypes