web-scraping – 在哪里下载电影数据集?

web-scraping – 在哪里下载电影数据集?,第1张

概述我想在单个文件中下载包含电影名称和演员列表等基本信息的电影转储.我找了几个选项,比如 http://api.themoviedb.org/2.1/和 http://api.themoviedb.org/2.1/. TheMovieDB没有提供批量下载数据的选项. IMDB有数据,但它似乎分散在各个文件中.此外,我无法弄清楚如何拼接演员,电影名称等单独文件中的数据,因为它们似乎没有任何常用键.如果我 我想在单个文件中下载包含电影名称和演员列表等基本信息的电影转储.我找了几个选项,比如 http://api.themoviedb.org/2.1/和 http://api.themoviedb.org/2.1/. TheMovIEDB没有提供批量下载数据的选项. IMDB有数据,但它似乎分散在各个文件中.此外,我无法弄清楚如何拼接演员,电影名称等单独文件中的数据,因为它们似乎没有任何常用键.如果我在这里遗漏了一些东西,请告诉我.

有人可以让我知道如何下载电影数据集吗?

解决方法 您可以使用Freebase以JsON格式下载 movies和 actors.有关更多信息,请参见 API wiki.

例如,查询:

GET https://www.GoogleAPIs.com/freebase/v1/mqlread?query=[{%22type%22:%22/film/actor%22,%22ID%22:null,%22name%22:null}]

将返回:

{  "result": [{    "type": "/film/actor","ID": "/en/milla_jovovich","name": "Milla Jovovich"  },{    "type": "/film/actor","ID": "/en/angus_macfadyen","name": "Angus Macfadyen"  },"ID": "/en/aisha_tyler","name": "Aisha Tyler"  },"ID": "/en/stephen_dorff","name": "Stephen Dorff"  },"ID": "/en/vincent_laresca","name": "vincent Laresca"  },"ID": "/en/dawn_greenhalgh","name": "Dawn Greenhalgh"  },"ID": "/en/nola_augustson","name": "Nola Augustson"  },"ID": "/en/dudley_moore","name": "Dudley Moore"  },"ID": "/en/julIE_andrews","name": "JulIE Andrews"  },"ID": "/en/bo_derek","name": "Bo Derek"  },"ID": "/en/robert_webber","name": "Robert Webber"  },"ID": "/en/dee_wallace-stone","name": "Dee Wallace-Stone"  },"ID": "/en/ryan_phillippe","name": "Ryan Phillippe"  },"ID": "/en/salma_hayek","name": "Salma Hayek"  },"ID": "/en/neve_campbell","name": "Neve Campbell"  },"ID": "/en/mike_myers","name": "Mike Myers"  },"ID": "/en/satoshi_tsumabuki","name": "Satoshi Tsumabuki"  },"ID": "/en/masanobu_ando","name": "Masanobu Ando"  },"ID": "/en/davID_gahan","name": "Dave Gahan"  },"ID": "/en/martin_gore","name": "Martin Gore"  },"ID": "/en/andrew_fletcher_1961","name": "Andrew Fletcher"  },"ID": "/en/alan_wilder","name": "Alan Wilder"  },"ID": "/en/gerard_butler","name": "Gerard Butler"  },"ID": "/en/lena_headey","name": "Lena headey"  },"ID": "/en/davID_wenham","name": "DavID Wenham"  },"ID": "/en/robert_de_niro","name": "Robert De Niro"  },"ID": "/en/gerard_depardIEu","name": "G\u00e9rard DepardIEu"  },"ID": "/en/dominique_sanda","name": "Dominique Sanda"  },"ID": "/en/john_belushi","name": "John Belushi"  },"ID": "/en/ned_beatty","name": "Ned Beatty"  },"ID": "/en/dan_aykroyd","name": "Dan Aykroyd"  },"ID": "/en/lorraine_gary","name": "Lorraine Gary"  },"ID": "/en/murray_hamilton","name": "Murray Hamilton"  },"ID": "/en/robert_downey_jr","name": "Robert Downey Jr."  },"ID": "/en/kIEfer_sutherland","name": "KIEfer Sutherland"  },"ID": "/en/winona_ryder","name": "Winona Ryder"  },"ID": "/en/john_hurt","name": "John Hurt"  },"ID": "/en/richard_burton","name": "Richard Burton"  },"ID": "/en/suzanna_hamilton","name": "Suzanna Hamilton"  },"ID": "/en/cyril_cusack","name": "Cyril Cusack"  },"ID": "/en/gregor_fisher","name": "Gregor Fisher"  },"ID": "/en/tony_leung_chiu_wai","name": "Tony Leung Chiu Wai"  },"ID": "/en/gong_li","name": "Gong li"  },"ID": "/en/faye_wong","name": "Faye Wong"  },"ID": "/en/takuya_kimura","name": "Takuya Kimura"  },"ID": "/en/zhang_ziyi","name": "Zhang Ziyi"  },"ID": "/en/carina_lau","name": "Carina Lau"  },"ID": "/en/chang_chen","name": "Chang Chen"  },"ID": "/en/bird_mcintyre","name": "Bird McIntyre"  },"ID": "/en/maggIE_cheung","name": "MaggIE Cheung"  },"ID": "/en/chevy_chase","name": "Chevy Chase"  },"ID": "/en/steve_martin","name": "Steve Martin"  },"ID": "/en/martin_short","name": "Martin Short"  },"ID": "/en/joe_mantegna","name": "Joe Mantegna"  },"ID": "/en/jon_lovitz","name": "Jon lovitz"  },"ID": "/en/alfonso_arau","name": "Alfonso arau"  },"ID": "/en/tony_plana","name": "Tony Plana"  },"ID": "/en/al_pacino","name": "Al Pacino"  },"ID": "/en/carmen_maura","name": "Carmen Maura"  },"ID": "/en/luis_hostalot","name": "Luis Hostalot"  },"ID": "/en/veronica_forque","name": "Veronica Forqu\u00e9"  },"ID": "/en/hume_cronyn","name": "Hume Cronyn"  },"ID": "/en/jessica_tandy","name": "Jessica Tandy"  },"ID": "/en/frank_mcrae","name": "Frank McRae"  },"ID": "/en/elizabeth_pena","name": "Elizabeth Pe\u00f1a"  },"ID": "/en/dennis_boutsikaris","name": "Dennis Boutsikaris"  },"ID": "/en/hal_warren","name": "Hal Warren"  },"ID": "/en/tom_neyman","name": "Tom Neyman"  },"ID": "/en/john_reynolds_1941","name": "John Reynolds"  },"ID": "/en/rajnikanth","name": "Rajnikanth"  },"ID": "/en/srIDevi_kapoor","name": "SrIDevi Kapoor"  },"ID": "/en/kantimathi","name": "Kantimathi"  },"ID": "/en/konkona_sen_sharma","name": "Konkona Sen Sharma"  },"ID": "/en/shabana_azmi","name": "Shabana Azmi"  },"ID": "/en/soumitra_chatterjee","name": "Soumitra Chatterjee"  },"ID": "/en/waheeda_rehman","name": "Waheeda Rehman"  },"ID": "/en/rahul_bose","name": "Rahul Bose"  },"ID": "/en/william_hopper","name": "William Hopper"  },"ID": "/en/joan_taylor","name": "Joan Taylor"  },"ID": "/en/frank_puglia","name": "Frank Puglia"  },"ID": "/en/james_garner","name": "James Garner"  },"ID": "/en/rod_taylor_1930","name": "Rod Taylor"  },"ID": "/en/eva_marIE_saint","name": "Eva MarIE Saint"  },"ID": "/en/paul_walker","name": "Paul Walker"  },"ID": "/en/eva_mendes","name": "Eva Mendes"  },"ID": "/en/devon_aoki","name": "Devon Aoki"  },"ID": "/en/john_payne_1912","name": "John Payne"  },"ID": "/en/evelyn_keyes","name": "Evelyn Keyes"  },"ID": "/en/brad_dexter","name": "Brad Dexter"  },"ID": "/en/frank_faylen","name": "Frank Faylen"  },"ID": "/en/peggIE_castle","name": "PeggIE Castle"  },"ID": "/en/jean-hugues_anglade","name": "Jean-Hugues Anglade"  },"ID": "/en/beatrice_dalle","name": "B\u00e9atrice Dalle"  },"ID": "/en/vincent_lindon","name": "vincent lindon"  },"ID": "/en/dominique_pinon","name": "Dominique Pinon"  },"ID": "/en/joaquin_phoenix","name": "Joaquin Phoenix"  },"ID": "/en/james_gandolfini","name": "James Gandolfini"  },"ID": "/en/catherine_keener","name": "Catherine Keener"  },"ID": "/en/norman_reedus","name": "norman Reedus"  },"ID": "/en/dean_martin","name": "Dean Martin"  }]}

同样,你会这样做:

https://www.GoogleAPIs.com/freebase/v1/mqlread?query=[{%22type%22:%22/film/film%22,%22name%22:null}]

获取电影标题.

总结

以上是内存溢出为你收集整理的web-scraping – 在哪里下载电影数据集?全部内容,希望文章能够帮你解决web-scraping – 在哪里下载电影数据集?所遇到的程序开发问题。

如果觉得内存溢出网站内容还不错,欢迎将内存溢出网站推荐给程序员好友。

欢迎分享,转载请注明来源:内存溢出

原文地址:https://54852.com/web/1059159.html

(0)
打赏 微信扫一扫微信扫一扫 支付宝扫一扫支付宝扫一扫
上一篇 2022-05-25
下一篇2022-05-25

发表评论

登录后才能评论

评论列表(0条)

    保存