Louis
2014-06-27 04:06:40 UTC
I am doing some analysis on some relatively large data set. To be more
specific, I am running a adjusted dickey-fuller test on a pandas timeseries
object. The length of the time series amounts to 5299788. I have 8GB of RAM
on board with about 5.5GB available when IDLE.
My problem is the program eat away all my RAM. So I couldn't do anything
else while it is running. I couldn't even kill the process as the system is
not responding to anything (i.e. ALT_CTRL_DEL). From my preliminary
calculation the calculation needs at least 2 days to complete. My question
is there any way to prevent the program from using all the resource so I
could still do other stuff while it is running in the background?
My data is only about 200MB.
the code is simple:
import pandas as pdimport statsmodels.tsa.stattools as ts
zn = pd.read_csv(path)
result = ts.adfuller(zn)
Much appreciate your help.
specific, I am running a adjusted dickey-fuller test on a pandas timeseries
object. The length of the time series amounts to 5299788. I have 8GB of RAM
on board with about 5.5GB available when IDLE.
My problem is the program eat away all my RAM. So I couldn't do anything
else while it is running. I couldn't even kill the process as the system is
not responding to anything (i.e. ALT_CTRL_DEL). From my preliminary
calculation the calculation needs at least 2 days to complete. My question
is there any way to prevent the program from using all the resource so I
could still do other stuff while it is running in the background?
My data is only about 200MB.
the code is simple:
import pandas as pdimport statsmodels.tsa.stattools as ts
zn = pd.read_csv(path)
result = ts.adfuller(zn)
Much appreciate your help.