Ikke's blog

'cause this is what I do

Python ‘all’ odity

[update] Question solved, see bottom of post.

Since Python 2.5 the language got a new built-in method ‘all’ (and it’s nephew ‘any’). I wanted to play around with this a little, combined with generators, so I created a little testcase to test performance.

Here’s the test-case: take a list L of X random numbers in a given range [A, B], and check whether

all elements in L are >= A
all elements in L are >= (A + Z) where Z is a number in [0, (B - A)]

The first test should always result True, the second test could result to False.

Here’s the output of a test-run:

In [1]: import random, sys

In [2]: a = [random.randint(100, sys.maxint) for i in xrange(2000000)]

In [3]: len(a)
Out[3]: 2000000

In [4]: #Check whether all elements are >= 100 

In [5]: %timeit all(i >= 100 for i in a)
10 loops, best of 3: 515 ms per loop

In [6]: %timeit any(i < 100 for i in a)
10 loops, best of 3: 454 ms per loop

In [7]: def f(l):
   ...:     for i in l:
   ...:         if i < 100:
   ...:             return False
   ...:     return True
   ...: 

In [8]: %timeit f(a)
10 loops, best of 3: 292 ms per loop

In [9]: #Same thing for 100000, since now the list shouldn't be completely iterated

In [10]: %timeit all(i >= 100000 for i in a)
100 loops, best of 3: 4.73 ms per loop

In [11]: %timeit any(i < 100000 for i in a)
100 loops, best of 3: 4.29 ms per loop

In [12]: def g(l):
   ....:     for i in l:
   ....:         if i < 100000:
   ....:             return False
   ....:     return True
   ....: 

In [13]: %timeit g(a)
100 loops, best of 3: 2.82 ms per loop

In [14]: #For reference

In [15]: %timeit False in (i >= 100 for i in a)
10 loops, best of 3: 531 ms per loop

In [16]: %timeit False in (i >= 100000 for i in a)
100 loops, best of 3: 5.03 ms per loop

It’s as if ‘all’, ‘any’ or ‘in’ don’t break/return when a first occurence of False (or True, obviously) is found. Is this the desired behaviour, and if it is, why? The calculation time difference between using all/any/in or a custom-made function (which is, unlike all etc, not written in C) which breaks whenever it can, is pretty astonishing.

[update] Question solved. It’s pretty normal the function-based approach performs better, since it combines what ‘all’ and the generator provided to ‘all’ do, taking away the generator function-call overhead. Damn

Posted in Development.

Tagged with performance, python.

3 comments

By Nicolas – May 1, 2008

Python if/else in lambda

Scott, in your “Functional Python” introduction you write:

The one limitation that most disappoints me is that Python lacks is a functional way of writing if/else. Sometimes you just want to do something like this:
lambda x : if_else(x>100, “big number”, “little number”)
(This would return the string “big number” if x was greater than 100, and “little number” otherwise.) Sometimes I get around this by defining my own if_else that I can use in lambda-functions:
def if_else(condition, a, b) :
   if condition : return a
   else         : return b

Actually, you don’t need this helper if_else function at all:

In [1]: f = lambda x: x > 100 and 'big' or 'small'

In [2]: for i in (1, 10, 99, 100, 101, 110):
...:     print i, 'is', f(i)
...:
1 is small
10 is small
99 is small
100 is small
101 is big
110 is big

James, obviously you’re right… Stupid me didn’t think about that. Your version won’t work when a discriminator isn’t known at import time. But even then a function taking *args and **kwargs with a class-like name, returning a correct class instance, would cut the job.

Regarding the module/plugin stuff, I’d rather use setuptools/pkg_resources

Posted in Development.

Tagged with functional programming, python.

17 comments

By Nicolas – February 16, 2008

Code Review comic

OSNews: WTFs per minute

I just love this comic.

Posted in Development.

Tagged with comic.

4 comments

By Nicolas – February 11, 2008

Python factory-like type instances

When designing applications or libraries, sometimes you need to be able to create instances of a certain interface (in a liberal sense) at runtime without knowing at write/compile time which specific implementation (class) you’ll need to use, as this could depend on runtime variables.

An example of this is an interface providing some functionality which should be implemented differently on different platforms, eg Linux and Windows.

There are some standard patterns how to achieve this. One of them is the factory pattern, which works somewhat like this Python example (let’s pretend ‘PLATFORM’ is ‘linux2′ or ‘win32′, ie sys.platform):

#Pretend we use sys.platform instead of PLATFORM where we use it
PLATFORM = 'linux2'

class FooBase(object):
    def say_foo(self):
        print 'foo'

class PlatformFoo(FooBase):
    def say_platform_foo(self):
        raise NotImplementedError

    @staticmethod
    def get_class():
        #Several ways to get this (dict, introspection, if-tree,...), pick yours
        klass = {
            'linux2': LinuxFoo,
            'win32': WindowsFoo,
        }.get(PLATFORM, None)
        if not klass:
            raise Exception, 'Platform not supported'
        return klass

class WindowsFoo(PlatformFoo):
    def say_platform_foo(self):
        print 'win32 foo'

class LinuxFoo(PlatformFoo):
    def say_platform_foo(self):
        print 'linux foo'

def main():
    foo_class = PlatformFoo.get_class()
    foo = foo_class()
    foo.say_platform_foo()

if __name__ == '__main__':
    main()

Executing this code will, as expected, write ‘linux foo’ to the console. Obviously we could not return the platform-specific class in a PlatformFoo function, but an actual instance, up to you.

Python allows you to handle this situation somewhat nicer though, without introducing any intermediate functions, by using metaclasses.

Continued…

Posted in Development, Technology.

Tagged with Development, python.

2 comments

By Nicolas – February 11, 2008

Funky C

It’s pretty funny to know this is valid C99 code, implementing a very basic array assignment and hello world:

%:include <stdio.h>
??=include <stdlib.h>

int main(int argc, char *argv<::>) ??<
    int i??(:> = {1, 2, 3??>;
    printf("Hello world\n");
    return 0;
%>

It’s (ab)using an obscure feature in the C89 and C99 specifications called Trigraph and Digraph, when using GCC you need to pass the ‘-trigraphs’ parameter to enable this functionality. More information can be found on the Wikipedia page about it. I wonder whether people joining code obfuscation games use this.

Posted in Development.

Tagged with C, coding.

7 comments

By Nicolas – February 9, 2008

How not to write Python code

Lately I’ve been reading some rather unclean Python code. Maybe this is mainly because the author(s) of the code had no in-depth knowledge of the Python language itself, the ‘platform’ delivered with cPython,… Here’s a list of some of the mistakes you should really try to avoid when writing Python code:

Remember Python comes batteries included
Python is shipped with a whole bunch of standard modules implementing a broad range of functionality, including text handling, various data types, networking stuff (both low- and high-level), document processing, file archive handling, logging, etc. All these are documented in the Python Library Documentation, so it is a must to browse at least through the list of available modules, so you get some notions of what you can use by default. An example: don’t introduce a dependency on Twisted to implement a very basic and simple custom HTTP server if you don’t have any performance needs, use BaseHTTPServer and derivates.
Python is Python, don’t try to emulate bad coding patterns from other languages
Python is a mature programming language which provides great flexibility, but also has some pretty specific patterns which you might not know in other languages you used before.
As an example, don’t try to emulate PHP’s ‘include’ or ‘require’ function, at all. This could be done, somewhat, by writing the code to be included (and executed on inclusion) in a module on the top level (ie. not in functions/classes/…), and using something like ‘from foo import *’ where you want this code to be executed. This will work, but it can become hard to maintain this. Modules are not meant to be used like this, so don’t. If you need to execute some code at some point, put it in a module as a function, import the function and call it wherever you want.

Continued…

Posted in Development, Technology.

Tagged with Development, python.

41 comments

By Nicolas – February 8, 2008

VirtualBox launch script

I’ve been playing around with VirtualBox some more today (more on it, and other virtualization related stuff might follow later). I got a Windows XP instance running fine (and fast!) in it. Still got some issues (shared folders seem not to work when logged in as a non-administrator user in XP, although I tend to blame Windows for this issue, not VirtualBox), played around with a little Linux installation acting as an iSCSI target for its virtual block devices, accessing them from Windows using the Microsoft iSCSI initiator, etc. Here’s a screenshot of all this.

I added a launcher for my Windows VM to my panel, which was simply running ‘VBoxManage startvm <VM UUID>’. This was not an optimal solution though, as I wanted it to show a little error dialog when something went wrong when attempting to launch the VM (eg because I forgot it was already running), when the virtual disk files aren’t accessible (because I forgot to mount the volume they reside on), etc.

So I cooked a little script which runs some sanity checks before launching the virtual machine, and reports any errors. Here it is:

Continued…

Posted in Linux, Technology.

Tagged with iscsi, virtualbox.

3 comments

By Nicolas – January 31, 2008

Massive 70000 workstation Ubuntu deployment in France

After switching to OpenOffice in 2005 and introducing Mozilla Firefox and Thunderbird on their machines in 2006, the French paramilitary police (‘Gendarmery’) will make the switch to 100% Free Software based desktops in the coming years. The migration should be completed in 2014.

All workstations will be converted to Ubuntu desktops, starting this year with 5000-8000 seats, growing to 12000-15000 over the next four years. By 2014, all 70000 (!!!) desktops should be running free software.

There are three major reasons for the full switch:

Remove dependency on one single supplier
Gain full control over the whole operating system stack
Reduce costs

Nowadays licensing costs sum up to 7000000€ (that’s seven million euros) every year.

I guess this must be one of the largest Linux desktops deployments ever?

Source: AFP

Posted in Desktop, Linux.

Tagged with Desktop, Linux.

4 comments

By Nicolas – January 30, 2008

LinuxJournal 2008 Readers’ Choice Survey criticism

Looks like the 2008 edition of LinuxJournal’s ‘Readers’ Choice Survey’ has been published some days ago. The available answers in several questions are rather badly chosen though… I’d think the LinuxJournal editors wouldn’t be this clueless. Some examples:

Question 3 asks about your favorite Desktop Environment, and lists GNOME, KDE, XFCE, Enlightenment and Fluxbox. Whilst the first 3 options can be regarded as DE’s, Enlightenment and Fluxbox are pure Window Managers, not really usable as an all-inclusive desktop.

Question 5, regarding your favorite email client: if you list any web-based client (Gmail, maybe someone should tell me how this is related to Linux?), you should at least list the major free software web-based clients too (thinking of Horde IMP here).

Continued…

Posted in Various.

Tagged with 2008 Readers' Choice, Linux Journal, survey.

No comments

By Nicolas – January 26, 2008

A quote by Phillip Vandervoort

“We regret Belgium doesn’t strictly adhere to this European standard, but chose a local implementation. As you know, respect for standards is important, especially in the domains of identification and authentication.”

- Phillip Vandervoort, General Manager Microsoft Belgium and Luxembourg

Emphasis is mine. Read in the 16/01 issue of the Belgian “IT Professional” magazine, page 29, included in the ProFOSS goody bag. The article is a response on an earlier published interview with Geert Mareels, manager of the Coordination Unit of Flemish e-Government, claiming one of the reasons e-ID isn’t widely used yet is due to Microsoft not providing any e-ID related solutions.

Original quote: “We betreuren het dan ook dat België deze Europese standaard niet volledig onderschrijft, maar gekozen heeft voor een lokale implementatie. Zoals u weet is het respect voor standaarden belangrijk, in het bijzonder in de domeinen van identificatie en authenticatie.”.

Posted in Various.

Tagged with microsoft, phillip vandervoort, standards.

3 comments

By Nicolas – January 24, 2008

« Previous Next »

@donsbot Any reason Data.ByteString.zipWith' isn't exported/public API? 12:30:27 AM December 17, 2011 from web in reply to donsbot Reply Retweet Favorite
RT @raichoo: "#Ocaml has an OOP extension… that nobody uses […] not even its inventor" -Yaron Minsky, Janestreet. 09:28:04 PM December 16, 2011 from web Reply Retweet Favorite
Whenever you launch a tech startup, don't look for office space: head to Starbucks. Coffee, power & free wifi, all you need is a laptop! 06:30:52 PM December 16, 2011 from web Reply Retweet Favorite
@viktorklang Bit me a couple of time in the Haskell bindings as well. Simple approach to make a library "threadsafe"? 03:19:38 PM December 16, 2011 from web in reply to viktorklang Reply Retweet Favorite
W00t ^_^ RT @johtib Screenshot teaser for something I've been working on lately: http://t.co/OyGog474 01:19:31 PM December 16, 2011 from web Reply Retweet Favorite

@eikke

Proudly powered by WordPress and Carrington.

Python ‘all’ odity

Python if/else in lambda

Code Review comic

Python factory-like type instances

Funky C

How not to write Python code

VirtualBox launch script

Massive 70000 workstation Ubuntu deployment in France

LinuxJournal 2008 Readers’ Choice Survey criticism

A quote by Phillip Vandervoort

About Ikke's blog

Meta

Friends

Me

Planets

License

Me @ Twitter

Subscribe

About Ikke's blog

Meta

Friends

Me

Planets

License

Me @ Twitter

Tags