Understanding Asynchronous/Multiprocessing in Pyth

Lets say I have a function:

from time import sleep

def doSomethingThatTakesALongTime(number):
  print number
  sleep(10)

and then I call it in a for loop

for number in range(10):
  doSomethingThatTakesALongTime(number)

How can I set this up so that it only takes 10 seconds TOTAL to print out:

$ 0123456789

Instead of taking 100 seconds. If it helps, I'm going to use the information YOU provide to do asynchronous web scraping. i.e. I have a list of sites I want to visit, but I want to visit them simultaneously, rather than wait for each one to complete.

标签： python multithreading asynchronous multiprocessing

4条回答

男人必须洒脱

2楼-- · 2019-06-05 04:46

asyncoro supports asynchronous, concurrent programming. It includes asynchronous (non-blocking) socket implementation. If your implementation does not need urllib/httplib etc. (that don't have asynchronous completions), it may fit your purpose (and easy to use, as it is very similar to programming with threads). Your above problem with asyncoro:

import asyncoro

def do_something(number, coro=None):
    print number
    yield coro.sleep(10)

for number in range(10):
    asyncoro.Coro(do_something, number)

0人赞添加讨论(0) 举报

小情绪 Triste *

3楼-- · 2019-06-05 04:47

Try to use Eventlet — the first example of documentation shows how to implement simultaneous URL fetching:

urls = ["http://www.google.com/intl/en_ALL/images/logo.gif",
     "https://wiki.secondlife.com/w/images/secondlife.jpg",
     "http://us.i1.yimg.com/us.yimg.com/i/ww/beta/y3.gif"]

import eventlet
from eventlet.green import urllib2

def fetch(url):
  return urllib2.urlopen(url).read()

pool = eventlet.GreenPool()
for body in pool.imap(fetch, urls):
  print "got body", len(body)

I can also advise to look toward Celery for more flexible solution.

0人赞添加讨论(0) 举报

Ridiculous、

4楼-- · 2019-06-05 04:54

Take a look at scrapy framework. It's intended specially for web scraping and is very good. It is asynchronus and built on twisted framework.

http://scrapy.org/

0人赞添加讨论(0) 举报

Understanding Asynchronous/Multiprocessing in Pyth

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间