Computational Biochemistry: Fun with Numba, NumPy and F2PY

Saturday, April 27, 2013

Fun with Numba, NumPy and F2PY

I've started using Numba to speed up MD simulations for our course in Molecular Statistics, which I teach together with +Jan Jensen and +Jimmy Charnley Kromann.

One exercise is using the Velo-Verlet algorithm to simulate a Lennard-Jones gas/liquid with periodic boundary conditions. Since we do everything in Python in this course, running the actual simulation is quite slow. What we do is we supply the students with a compiled FORTRAN module, compiled with F2PY which they can use when they've written their own Lennard-Jones gradient code.

The FORmula TRANslator module is extremely fast, compared to Python code. I rewrote the code from last year's course to a function I could use with Numba and compare directly to the F2PY module

For 100 particles:

Numba: 0.146 ms/iteration (466x speed up)
F2PY: 0.236 ms/iteration (289x speed up)
Python: 68.06 ms/iteration

Now, I'm pretty sure a pure FORTRAN implementation would be faster than the F2PY, but I am very impressed with the Numba, since it's standard Python code and MUCH more simple to read than FORTRAN -- in theory all you have to do is write @autojit before a function definition. This is of course in theory ... I didn't do anything to improve on the readability of the Lennard-Jones code here. Before I posted, I had actually inserted a comment saying """This code block is unreadable""".

Now, there were a few issues I discovered:

1) @autojit vs. @jit(argtypes=[double[:,:], double[:]])

Since Numba has to decide what arguments a function takes before it's being compiled, it needs to know what possible type of arguments the function takes. If you're lucky, Numba decides on the right type and you can get away with @autojit, but sometimes autojitting makes the code SLOWER than standard, interpreted Python. In one case I got a factor of 10x slower with autojit, but explicitly stating the function argument types with @jit(argtypes = .... ) I got a speed up of 20x on the same code compared to interpreted Python.

2) Returning tuples in compiled code block

Numba does not allow a tuple to be returned inside a compiled code block. So don't do this.

3) Use Numpy properly

Numba is NumPy aware, so code ran faster when numbers were stored in numpy.array types rather than just regular Python lists.
At first I was numpy.round() to round to nearest integer in the periodic boundary condition code. Switching three numpy.round() calls to numpy.rint() gave a speed up of around 100x on the code execution.
Use U[i, j] instead of U[i][j] on NumPy arrays. Not much of a difference in the vanilla Python code, but MASSIVE speed gains in compiled code.

Python Lennard-Jones code:

F2PY Lennard-Jones code:

Python Velo-Verlet solver:

8 comments:

sandeep saxenaFebruary 4, 2019 at 9:42 AM
This information was very useful to me and I thank you for this.keep sharing more like this.
ccna course in Chennai
ccna institute in Chennai
Python course in Chennai
Python Training Institute in Chennai
Angularjs course in Chennai
ccna Training in Anna nagar
ccna Training in T nagar
ReplyDelete
Replies
Anbarasan14February 12, 2019 at 6:22 AM
This was a wondereful post being shared. The entire content in this blog is extremely helpful for me and gave me a clear idea on the concepts.
german classes in mulund
german language classes in mulund
german classes in mulund west
German Course in Mulund East
French Classes in Mulund
French Classes in Mulund East
French Classes in Mulund West
French Language Classes in Mulund
ReplyDelete
Replies
Sadhana RathoreFebruary 12, 2019 at 1:24 PM
Fabulous post admin, it was too good and helpful. Waiting for more updates.
AWS Training in Chennai
DevOps Training in Chennai
Data Science Course in Chennai
Blue Prism Training in Chennai
R Programming Training in Chennai
RPA Training in Chennai
ReplyDelete
Replies
salomeMarch 5, 2021 at 3:16 PM
thanks for posting.valuable information about numpy and other stuffs

https://thoughtsonqa.blogspot.com/2013/11/5-ways-to-screw-up-application-security.html?showComment=1614953609364#c8638992903928140269
ReplyDelete
Replies
salomeMarch 5, 2021 at 3:17 PM
computational biochemistry in a effective way.Thanks for blogging

Python Training in chennai | Python Classes in Chennai
ReplyDelete
Replies
Mingle SwipeJanuary 29, 2023 at 7:18 AM
Hello,
Can I get some questions answers because I don't have any dating sitge account but I am getting dating messages.. Why am I getting Spam Emails for Dating Sites
ReplyDelete
Replies
queenartsJuly 17, 2023 at 4:30 PM
Informative post thanks for sharing
Sai Satcharitra pdf
Sai Satcharitra Tamil pdf
Sai Satcharitra Hindi pdf
ReplyDelete
Replies
vinosparkAugust 9, 2023 at 6:36 PM
Very useful post

gold rate in chennai
gold rate today namakkal
gold price today salem
gold rate today madurai
gold rate today at grt
ReplyDelete
Replies

Add comment