Valued Lessons: Why are my monads so slow?

March 19, 2008

Why are my monads so slow?

If you've read the last few posts, you know that I choose to use monadic parsing for some application code I am writing. At an abstract level, it turned out beautifully. I was able to write the parser very elgantly and compactly despite the serialization format having some unique requirements. But at a practical level, it failed pretty miserably. It was horribly slow. With a deadline looming, I had to abandon my monadic parser and go back to using an older serialization format. But not content to give up so easily, I decided to figure out why are my monads so slow? I hope that my investigation may be of help to you if you choose to use similar techniques in your code.

First, I tried running a profiler like cProfile. While such profilers normally work quite well, they don't work so well with lots of first-class functions. And since monadic code is full of first-class functions, the profiler didn't give much valuable information.

So, I had to do optimizing the "old fashioned" way. I wrote a benchmark and ran both of my serializers against it. To start, I had two data points: one in traditional imperitave code and one in fully monadic code, and this is what I got:

imperative: 0.465 seconds
monadic:    3.893 seconds

Monadic code was 8x slower! But why? To narrow it down, I added some more data points by implementing the serializer in gradually more monadic ways. For example, I wrote a serializer in a functional/immutable style that passes around a "stream" rather than a string, and then another that passed around "state" in a very monadic way, but withoug using the monad class that I used in Pysec. Then I got this:

imperative:                0.465 seconds
functional with stream:    1.018 seconds
almost monadic with state: 2.450 seconds
monadic:                   3.893 seconds

This gave some answers. One answer it gave was that passing around an immutable "stream" object is twice as expensive as merely passing around a string. Also, passing around a state object on top of that is even more expensive. With these clues, I was able to optimize in certain ways and reduce it to this:

imperative:                0.465 seconds
functional with stream:    1.018 seconds
almost monadic with state: 1.098 seconds
monadic:                   1.283 seconds

That's a nice 3x improvement. What things were I able to do? By far the biggest cost that I was able to eliminate was object creation. I flattened two layers of abstraction into one, and thus split the number of objects created in half. The second thing I did was I created a "read until" operation that could be used more efficiently in the "grammar" of the parser. This is a form of pushing performance-critical code down the layers of abstraction. Finally, I didn't used decorators, for some often-called functions.

In the end, it looks like monadic code is about 2-3x slower in Python. Almost all of that is actually the result of just doing things in a functional/immutable way. In particular, creating data structures appears to be the main bottleneck. In functional languagues, these types of operations are very common and optimized heavily, and so are typically very fast. It looks like it's not the same in Python. Just like you have to avoid recursion because there's no tail recursion, it looks like you have to avoid a functional/immutable coding style if you care about performance because object creation is so slow. On the other hand, if you don't mind the peformance hit, it makes the code much more elegant, just like recursion usually does.

One thing of interest is that there is essentially no performance penatly for using monads over using a functional/immutable code style. The 20% penatly seen between "almost monadic" and "monadic" is only because I'm wrapping the monad in a Parser class which allows nice operator overloading.

Here's a summary of what you can do to speed up any functional/immutable-style code, including monadic code when writing in Python:

Make object creation as fast as possible. Don't do anything fancy in __init__.
Use as few layers of abstraction as possible, especially when there is an object created in each layer.
Push common or expensive operations down the layers of abstaction, especially if it avoids creating objects.
Avoid using decorators for heavily used functions.
Don't use wrapper classes if you don't have to.

As a final thought, I'd like to mention that while there's currently a substantial performance penalty for using immutable data structures, that style is going to become increasingly important as we enter the many-core era. No matter what style of concurrency you like, immutable data is always easier to work with when concurrency is involved. Concurrency and mutable data are just a bad combo. I think that it's going to be very important for language designers to address this when working on the peformance of their languages. I certainly hope future versions Python are much faster with immutable data. If they are, then the peformance penatly of using Monads will almost disappear.

27 comments:

AnonymousMarch 19, 2008 at 5:31 PM
I'd be curious to know if the JIT optimization performed by Psyco reduces the relative penalty for working with immutable objects.

Seeing what PyPy does to this code would be even more interesting, but that project is so bleeding edge, I have no idea what other issues you would stumble on.
ReplyDelete
Replies
ShubMarch 20, 2008 at 4:51 AM
One thing worth keeping in mind about Python is that function calls are expensive in time, more so than most other languages. So to get the fastest Python code, write in an imperative style and rely heavily on builtins.

Not good news for someone who wants to write fast functional code in Python. :D
ReplyDelete
Replies
AnonymousMarch 20, 2008 at 5:09 AM
Very interesting. D language version 2.x has true closures, I'd like to see how your code works there.
ReplyDelete
Replies
Tac-TicsMarch 20, 2008 at 6:55 AM
Python doesn't do any optimization for them at all. Monads are incredibly flexible, but as any decent C.S. major knows, everything comes at a price. Monads give a high level of abstraction and can model a huge variety of computations. There's no reason to expect them to be on par with simpler, to-the-point styles of programming.

Monads have their place in programming, but that place is Haskell and other pure languages.
ReplyDelete
Replies
9000March 25, 2008 at 1:23 PM
AFAIK, decorators only work at function creation time and do not penalize actual calls, except for an extra function call.
Function calls are relatively expensive in Python, though.
ReplyDelete
Replies
AnonymousMarch 29, 2008 at 3:21 PM
When I was optimizing pyparsing, I found the @profile decorator described on the Python wiki to be very helpful.

The most primitive monads (such as pyparsing's Literal and Word) are those that benefit most from code optimization.

Function calls are peformance death in python, but some operations are more expensive then function calls. When optimizing Literal, I found that replacing

if instring[curLoc:curLoc+literalLength]==literalstring:

with

if literalstring.startswith(instring, curloc):

was quite a healthy win. I also apparently invented a cacheing system for exceptions that is unseen elsewhere in Python. Exceptions are used everywhere to signal when one of several parsing options doesn't match - this means that thousands of Exception instances are created and almost immediately discarded. Each monad ends up raising the same exception (with some minor, er, exceptions), so I create the Exception at monad creation time, and then just keep raising the cached Exception instead of creating a new one.
ReplyDelete
Replies
Oliver MauriceOctober 18, 2017 at 6:10 AM
How about you check out this site for some information about writing good essay
ReplyDelete
Replies
Arbaz AliJanuary 12, 2021 at 12:13 AM
here you can get more information about joining the Pakistan navy
Join Pak Navy
ReplyDelete
Replies
Roman DavisApril 14, 2021 at 12:41 AM
Different views have been shared ibn this blog by expert writers who have their own choices about papers because most of the students are worried about their papers and they are feeling the need of qualified writers. dissertation writing services
ReplyDelete
Replies
vé máy bay đi CanadaApril 25, 2021 at 8:22 PM
Liên hệ Aivivu, đặt vé máy bay tham khảo

vé máy bay đi Mỹ bao nhiêu

vé máy bay hà nội sài gòn

vé máy bay sài gòn hà nội

vé máy bay đi Huế

vé máy bay hà nội quy nhơn bamboo

taxi 7 chỗ đi sân bay nội bài

combo phú quốc 4 ngày 3 đêm vinpearl
ReplyDelete
Replies
sriJuly 22, 2021 at 12:37 AM
Thanks for sharing this informative content.,
Leanpitch provides online training in Scrum Master Certification during this lockdown period everyone can use it wisely.
Join Leanpitch 2 Days CSM Certification Workshop in different cities.

Scrum master certification online

CSM certification online

CSM online

CSM online certification

CSM online training

CSM training online

Scrum master training online
ReplyDelete
Replies
SCARLET BROWNOctober 27, 2021 at 1:03 AM
I think this is the best I’ve seen till now. You can certainly visit your expertise inside the article you write.

성인야설
ReplyDelete
Replies
KUPZOctober 27, 2021 at 1:13 AM
An outstanding share! I have just forwarded this onto a co-worker who was doing a little homework on this. And he actually bought me dinner simply because I discovered it for him… lol. So let me reword this…. Thanks for the meal!! But yeah, thanks for spending some time to talk about this topic here on your web site.

마사지
ReplyDelete
Replies
WAMOSOctober 27, 2021 at 1:17 AM
I really like your blog site.. excellent shades & style. Do a person pattern this excellent website oneself or even have people hire an attorney to make it happen available for you? Please answer while I!|m seeking to style and design my very own blog as well as would wish to learn where u obtained this specific out of thanks a lot 스포츠마사지
ReplyDelete
Replies
AnonymousNovember 29, 2021 at 4:21 AM
To be honest, I do not have any idea about monads. I tried to understand its meaning from your post, but could not get it. However, I want to do more research on it, but I do not have time to do it because I have to find the best and Cheap Assignment Help UK based service to complete my assignment.
ReplyDelete
Replies
카지노검증May 12, 2022 at 9:48 PM
카지노검증
ReplyDelete
Replies
oncasinoMay 22, 2022 at 1:40 AM
oncasino
ReplyDelete
Replies
John HardySeptember 22, 2022 at 3:53 AM
This blog is too informative and very helpful. I really like your good work and very happy to read this. Now it's time to avail african gowns for ladies for more information.

ReplyDelete
Replies
Digitizing servicesMay 15, 2023 at 5:20 AM
what is monad's I don't know any information about monad's so please tell me some information about monad's and thanks for posting this information.
ReplyDelete
Replies
johnsonJuly 3, 2023 at 7:19 AM
It provides a fantastic job of outlining the various difficulties we encounter and offering solutions for how to deal with them. I especially value the emphasis on fiscal policy and how it can be applied to boost the economy.
Abogados de divorcio en atlantic city nueva jersey
ReplyDelete
Replies
Alex petterAugust 9, 2023 at 5:53 AM
Wow! what a wonderful blog for monadic parsing. I really like your good work and very happy to read your informative blog. Now it's time to avail Best 10 seat minibus hire in Dartford for more information.
ReplyDelete
Replies
Belle MaxwellMay 29, 2024 at 2:38 AM
Look forward to seeking more of this fantastic post. Thanks for this!
ReplyDelete
Replies
Eden YoungMay 29, 2024 at 2:38 AM
You are among the top writers of this generation. Great article. keep it up!
ReplyDelete
Replies
Zoey JaramilloMay 29, 2024 at 2:39 AM
Can’t wait to see your post soon. Good Luck for the upcoming update.
ReplyDelete
Replies
Riggs FlynnMay 29, 2024 at 2:39 AM
This is very interesting article and effective dude. Thankyou for sharing
ReplyDelete
Replies
Conner CanoMay 29, 2024 at 2:39 AM
Your style is so unique. I’ve enjoyed read stuff here. Waiting for your next post.
ReplyDelete
Replies
Amelia JohnSeptember 18, 2024 at 11:55 PM
Fantastic post! I've been dealing with the similar problem in my monad implementations. One thing I found critical was to assess the efficiency of my underlying data structures and algorithms. Sometimes tweaking these can dramatically speed things up. Integrating this with contract law research topics could also provide valuable insights into how we handle complex data in a legal context.
ReplyDelete
Replies

Add comment

Valued Lessons

March 19, 2008

Why are my monads so slow?

27 comments:

Blog Archive

Labels

Friends

Google Analytics