I made a pipeline to put scrapy data to my Parse Backend
PARSE = 'api.parse.com' PORT = 443
However, I can't find the right way to post the data in Parse. Because everytime it creates undefined objects in my Parse DB.
class Newscrawlbotv01Pipeline(object):
def process_item(self, item, spider):
for data in item:
if not data:
raise DropItem("Missing data!")
connection = httplib.HTTPSConnection(
settings['PARSE'],
settings['PORT']
)
connection.connect()
connection.request('POST', '/1/classes/articlulos', json.dumps({item}), {
"X-Parse-Application-Id": "XXXXXXXXXXXXXXXX",
"X-Parse-REST-API-Key": "XXXXXXXXXXXXXXXXXXX",
"Content-Type": "application/json"
})
log.msg("Question added to PARSE !", level=log.DEBUG, spider=spider)
return item
Example of an error :
TypeError: set([{'image': 'http://apps.site.lefigaro.fr/sites/apps/files/styles/large/public/thumbnails/image/sport24.png?itok=caKsKUzV',
'language': 'FR',
'publishedDate': datetime.datetime(2016, 3, 16, 21, 53, 10, 289000),
'publisher': 'Le Figaro Sport',
'theme': 'Sport',
'title': u'Pogba aurait rencontr\xe9 les dirigeants du PSG',
'url': u'sport24.lefigaro.fr/football/ligue-des-champions/fil-info/prolongation-entre-le-bayern-et-la-juve-796778'}]) is not JSON serializable