Google Drive API on upload--where are these extra

2019-07-20 06:00发布

To summarize the program: I'm downloading a file from my Google Drive, I'm then opening and reading a file("file_a.txt") in my local machine, then I'm opening another file in my machine("file_b.txt") in append mode, and I'm appending "file_a.txt" into "file_b.txt" before updating my Google Drive with this new "file_b".

For some reason I'm getting these blank lines. And they expand in a very peculiar way. Here's the final output of my program with the ghost lines:

==
***
name
































class
====
***
some new song
















class
====
***
final








class
====
***
final




class
====
***
final


class
====
***
final

class
==

See how with every new successive python main.py run, the gap between "name" and "class" of the previous writes gets wider and wider, top to bottom?
edit: there's a pattern to the blank lines. They are doubling. 2->4->8->16->32 in my files.

Here's my "file_a.txt" that I'm reading from:

==
***
name

class
==

There should only be one blank line..

"file_b.txt" is initially blank.

Now here's the python program that downloads and updates according to the Google Drive API:

drive_service = discovery.build('drive', 'v3', http=http)
file_id, name = search_file(drive_service, 10, "name contains 'test_file'")
print(file_id, name)

if verified_file(a, b):
    file_path = "file_b.txt"
    downloaded = download_file(drive_service, file_id, file_path)
    with open('file_a.txt', 'r') as myfile:
        data = myfile.read()
    with open("file_b.txt", "a+") as myfile:
        myfile.seek(0, 2)

        myfile.write(data)

    path_of_new_file = "file_b.txt" # the file to upload
    file_name_on_drive = "test_file" # the name to upload as
    if file_id:
        update_file(drive_service, file_id, path_of_new_file, file_name_on_drive)  

Where are these blank lines coming from?

edit: does it have anything to do with the mimetype?
In both my download, and upload, they are being targeted as 'text/plain':

def download_file(service, file_id, filepath):
    request = service.files().export_media(fileId=file_id, mimeType='text/plain')
    fh = io.BytesIO()
    downloader = MediaIoBaseDownload(fh, request)
    done = False
    while done is False:
        status, done = downloader.next_chunk()
        print("Download %d%%." % int(status.progress() * 100))
    with io.open(filepath,'wb') as f:
        fh.seek(0)
        f.write(fh.read())  

def update_file(service, file_id, path_of_new_file, file_name_on_drive):
    mimetype = 'text/plain'
    file_metadata = {'name': file_name_on_drive}
    media = MediaFileUpload(path_of_new_file,
                            mimetype=mimetype)
    file = service.files().update(fileId=file_id,
                                  body=file_metadata,
                                  media_body=media).execute()
    print('File ID: %s' % file.get('id'))

Edit:

Something tells me newlines aren't the issue either:

Here's how I updated the code to strip newlines:

with open('file_a.txt', 'r') as myfile:
    data = myfile.readlines()
    print("data,", data)

with open("file_b.txt", "a+", newline='') as f:
    readfile = f.readlines()

    print("read", readfile)

    list2 = [a for a in readfile if a != '\n']
    print("newread", list2)

    list2.extend(data)

    print("2", list2)
    for item in list2:
        f.write(item)

path_of_new_file = "file_b.txt" # the file to upload
file_name_on_drive = "test_file" # the name to upload as
if file_id:
    update_file(drive_service, file_id, path_of_new_file, file_name_on_drive)  

And Command line output during run to prove there arent rogue newlines right before updating:

Download 100%.
data, ['==\n', '***\n', 'final\n', '\n', 'class\n', '==']
read []
newread []
2 ['==\n', '***\n', 'final\n', '\n', 'class\n', '==']  

and still on my google drive, there will be blank lines that get wider with every update.

0条回答
登录 后发表回答