comparison Code/taste_profile_cleaning.py @ 42:a4dfbc53a162

Delete unnecesary spaces
author Paulo Chiliguano <p.e.chiilguano@se14.qmul.ac.uk>
date Mon, 21 Nov 2016 00:08:16 -0500
parents 1de207a22c1a
children
comparison
equal deleted inserted replaced
41:1de207a22c1a 42:a4dfbc53a162
39 def delete_triplets(zippedfile='train_triplets.txt.zip', 39 def delete_triplets(zippedfile='train_triplets.txt.zip',
40 mismatchesfile='sid_mismatches.txt'): 40 mismatchesfile='sid_mismatches.txt'):
41 """ 41 """
42 Delete triplets with songIDs mismatches and unavailable audio clips from 42 Delete triplets with songIDs mismatches and unavailable audio clips from
43 7Digital (UK) 43 7Digital (UK)
44 44
45 This is applied on Taste Profile subset. 45 This is applied on Taste Profile subset.
46 46
47 :type zippedfile: string 47 :type zippedfile: string
48 :param zippedfile: filename of the downloaded subset 48 :param zippedfile: filename of the downloaded subset
49 49
50 :type mismatchesfile: string 50 :type mismatchesfile: string
51 :param mismatchesfile: filename of the downloaded list of mismatches 51 :param mismatchesfile: filename of the downloaded list of mismatches
52 52
53 """ 53 """
54 tripletsfile = unzip_tasteprofile(zippedfile) 54 tripletsfile = unzip_tasteprofile(zippedfile)
55 mismatches = read_songid_mismatches(mismatchesfile) 55 mismatches = read_songid_mismatches(mismatchesfile)
56 print("There are %d songId-trackId mismatches." % len(mismatches)) 56 print("There are %d songId-trackId mismatches." % len(mismatches))
57 availableClips = read_available_songid('7digital/CF_dataset_7digital.txt') 57 availableClips = read_available_songid('7digital/CF_dataset_7digital.txt')