Pete Jones<p>Does anyone know whether the IMDb non-commercial datasets (<a href="https://developer.imdb.com/non-commercial-datasets/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">developer.imdb.com/non-commerc</span><span class="invisible">ial-datasets/</span></a>) include ALL the titles on IMDb?</p><p>They say that it's a "subset" of the full data, but it's unclear whether they mean a subset of the variables available for each title or a subset of the titles on IMDb.</p><p>Here are the counts per title type for data downloaded this week. Do these look plausible as the full counts? We're particularly interested in the movies.</p><p>The stuff you can find by searching online for "number of movies on IDMb" suggests that these counts are the totals, but I don't know how reliable those pages are.</p><p><a href="https://hcommons.social/tags/imdb" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>imdb</span></a> <a href="https://hcommons.social/tags/FilmMastodon" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FilmMastodon</span></a> <a href="https://hcommons.social/tags/digitalhumanities" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>digitalhumanities</span></a> <a href="https://hcommons.social/tags/datascience" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datascience</span></a></p>