I'd like to build an efficient Python iterator/generator that yields:
- All composite numbers less than N
- Along with their prime factorization
I'll call it "composites_with_factors()"
Assume we already have a list of primes less than N, or a primes generator that can do the same.
Note that I:
- DO NOT need the numbers to be yielded in numerical order
- DO NOT care if 1 is yielded at the beginning or not
- DO NOT care if primes are yielded, too
I figure this can be done with a clever recursive generator...
So, for example, a call to composites_with_factors(16) may yield:
# yields values in form of "composite_value, (factor_tuple)"
2, (2)
4, (2, 2)
8, (2, 2, 2)
6, (2, 3)
12, (2, 2, 3)
10, (2, 5)
14, (2, 7)
3, (3)
9, (3, 3)
15, (3, 5)
5, (5)
7, (7)
11, (11)
13, (13)
As you can see from the order of my output, I conceive of this working by starting with the smallest prime on the available primes generator, and outputting all powers of that prime less than N, then try again through the powers of that prime but at each stage seeing if I can apply powers of additional primes (and still be less than N). When all combinations with THAT prime are done, drop it, and repeat with the next lowest prime number available on the primes generator.
My attempts to do this with "recursive generators" have gotten me very confused on when to pop out of the recursion with "yield ", or "raise StopIteration", or "return", or simply fall out of the recursed function.
Thanks for your wisdom!
ADDITIONAL NOTE:
I do have one way to do this now: I have written a function to factor numbers, so I can factor them down to primes, and yield the results. No problem. I keep this blazingly fast by relying on a cache of "what is the lowest prime factor of number N"... for N up to 10 million.
However, once I'm out of the cache, we'll, it devolves to "naive" factoring. (Yuck.)
The point of this post is:
- I'm assuming that "generating large composites from their factors" will be faster than "factoring large composites"... especially since I DON'T care about order, and
- How can you have a Python generator "recursively" call itself, and yield a single stream of generated things?
Assuming
primesiter(n)
creates an iterator over all primes up ton
(1 should NOT be included inprimesiter
, or following code well enter inf. loop)Output
NOTE: it includes n (= 16) as well, and I used list instead of tuples. Both can easily be resolved if needed, but I will leave that as an exercise.
Here is a sieve-based implementation (please excuse the un-pythonic code :) ):
Tests:
On my machine, this took 56 seconds to run:
Examples:
Memory consumption: about 50 million integers, in 14 million lists:
Recursively (pseudo-code):