Reading data (just 20000 numbers) from a xlsx file takes forever:
import pandas as pd
xlsxfile = pd.ExcelFile("myfile.xlsx")
data = xlsxfile.parse('Sheet1', index_col = None, header = None)
takes about 9 seconds.
If I save the same file in csv format it takes ~25ms:
import pandas as pd
csvfile = "myfile.csv"
data = pd.read_csv(csvfile, index_col = None, header = None)
Is this an issue of openpyxl or am I missing something? Are there any alternatives?