Pygithub: github.PaginatedList.PaginatedList totalCount return None

创建于 2016-07-02  ·  14评论  ·  资料来源: PyGithub/PyGithub

我的问题是为什么调用 totalCount 直接返回 None,但 iterate 会得到数字。

            repo_commits = repo.get_commits()
            repo_total = 0
            for _ in repo_commits:
                repo_total = repo_total + 1
            repo_total = repo.get_commits().totalCount

最有用的评论

对于要点:

    from github import Github
    gh = Github()
    gists = gh.get_user('gil9red').get_gists()
    print(gists.totalCount)  # None
    print(len(list(gists)))  # 7

所有14条评论

我粘贴我的代码

import json
import os

import github
from github import Github

# First create a Github instance:

g = Github("grapebaba", "heatonn1",per_page=1000)


def main():
    '''
    Use small data for this application
    :return:
    '''
    with open(os.path.join(os.path.expanduser("~"),'recruitbot_data.txt'),'w') as f:
        for user in g.search_users("type:user")[0:10000]:
            user_dict = {}
            user_dict['username'] = user.login
            user_dict['id'] = user.id
            user_dict['profile_url'] = user.html_url
            user_dict['location'] = user.location
            user_dict['followers'] = user.followers
            user_dict['private_gists'] = user.private_gists
            user_dict['public_gists'] = user.public_gists
            user_dict['name'] = user.name
            user_dict['company'] = user.company
            user_dict['blog_url'] = user.blog
            user_dict['email'] = user.email
            user_dict['id'] = user.id
            user_dict['contributions']={}
            for repo in user.get_watched():
                try:
                    repo_total = 0
                    for _ in repo.get_commits():
                        repo_total = repo_total + 1
                    if repo.get_stats_contributors() is not None:
                        for contributor in repo.get_stats_contributors():
                            if contributor is not None and contributor.author.id == user_dict['id']:
                                user_dict['contributions'][repo.name]={}
                                user_dict['contributions'][repo.name]['contributor_commits']=contributor.total
                                user_dict['contributions'][repo.name]['repo_commits']=repo_total
                                user_dict['contributions'][repo.name]['language']=repo.language
                                user_dict['contributions'][repo.name]['stars']=repo.stargazers_count
                                print user_dict
                                break
                except github.GithubException as e:
                    print e

            f.write(json.dumps(user_dict)+"\n")

if __name__ == '__main__':
    main()

我有另一个问题,有时我会收到 SSL 错误

Traceback (most recent call last):
  File "/tmp/collector.py", line 54, in <module>
    main()
  File "/tmp/collector.py", line 36, in main
    for _ in repo.get_commits():
  File "/usr/local/lib/python2.7/dist-packages/github/PaginatedList.py", line 48, in __iter__
    newElements = self._grow()
  File "/usr/local/lib/python2.7/dist-packages/github/PaginatedList.py", line 60, in _grow
    newElements = self._fetchNextPage()
  File "/usr/local/lib/python2.7/dist-packages/github/PaginatedList.py", line 161, in _fetchNextPage
    headers=self.__headers
  File "/usr/local/lib/python2.7/dist-packages/github/Requester.py", line 171, in requestJsonAndCheck
    return self.__check(*self.requestJson(verb, url, parameters, headers, input, cnx))
  File "/usr/local/lib/python2.7/dist-packages/github/Requester.py", line 212, in requestJson
    return self.__requestEncode(cnx, verb, url, parameters, headers, input, encode)
  File "/usr/local/lib/python2.7/dist-packages/github/Requester.py", line 251, in __requestEncode
    status, responseHeaders, output = self.__requestRaw(cnx, verb, url, requestHeaders, encoded_input)
  File "/usr/local/lib/python2.7/dist-packages/github/Requester.py", line 281, in __requestRaw
    output = response.read()
  File "/usr/lib/python2.7/httplib.py", line 557, in read
    s = self._safe_read(self.length)
  File "/usr/lib/python2.7/httplib.py", line 664, in _safe_read
    chunk = self.fp.read(min(amt, MAXAMOUNT))
  File "/usr/lib/python2.7/socket.py", line 380, in read
    data = self._sock.recv(left)
  File "/usr/lib/python2.7/ssl.py", line 341, in recv
    return self.read(buflen)
  File "/usr/lib/python2.7/ssl.py", line 260, in read
    return self._sslobj.read(len)

你好,
关于您的 totalCount 为 None,我对 repo.get_pulls().totalCount 也有相同的看法。
我认为问题在于返回的 JSON 不包含数据 ['total_count']。
也许分页列表应该实现__len__

if self.__totalCount:
    return self.__totalCount
else:
   return len(self.__elements)

@pgmillon听起来很公平。
如果你想起草一个这样做的小公关,我愿意合并它。

注意到同样的事情,PaginatedList 从不设置 totalCount。 返回元素的 len 不是我要找的,我想知道实际上有多少项目可以获取。

同意,但 AFAIK API 没有提供任何方式知道这一点。 所以唯一的方法是目前的解决方法:

opened_pulls = repository.get_pulls()
pulls_count = 0
# Fix no count available on pulls list
for _ in opened_pulls:
    pulls_count += 1

对于要点:

    from github import Github
    gh = Github()
    gists = gh.get_user('gil9red').get_gists()
    print(gists.totalCount)  # None
    print(len(list(gists)))  # 7

还是一样的问题:

from github import Github
g = Github()
repos = g.get_repos()
print(repos.totalCount) # None

@gil9red @Tigralt您是否试图获取迭代PaginatedList返回的项目总数? 如果是这样,如果不迭代 PaginatedList 并增加计数,就没有办法做到这一点。 请参阅此处的文档(即使它们很旧,它们也是正确的)

GitHub 无法知道分页请求将返回的项目数,因此 PaginatedList 没有长度:
...
如果您真的想获取分页列表的长度,则必须明确 [原文如此] 构造一个列表,然后使用其长度:

如果您尝试获取 PaginatedList 中一页上的项目数,看起来此 PR仍处于开放状态并正在讨论中。

看几个相关的问题 #433 #487 #596

我猜total_count来自旧的 Github API 响应? 我们应删除TotalCount的属性PaginatedList ,因为它的实现是坏,总是返回None 。 相反,我们可以在文档中添加一些内容,例如:

# To get the total number of available elements in PaginatedList
repos = g.get_user().get_repos()
print(len(list(repos)))  # we can't avoid to iterate through the whole set to get total count

也许我们可以实现__len__来返回当前元素计数?

我认为 Github API 将返回查询的总页数,因此如果您设置per_page=1您应该能够从单个请求中获取项目总数。 这可能比迭代(对于大型结果集)更有效。

@Tommos0我明白了,这可能是个好主意。 我们甚至可以使用HEAD而不是GET来检索Link标头。

此问题已自动标记为陈旧,因为它最近没有活动。 如果没有进一步的活动发生,它将被关闭。 感谢你的贡献。

已修复 #820

此页面是否有帮助?
0 / 5 - 0 等级

相关问题

hren-ron picture hren-ron  ·  6评论

psychemedia picture psychemedia  ·  5评论

Borkason picture Borkason  ·  4评论

AdyaAbhra picture AdyaAbhra  ·  5评论

jacquev6 picture jacquev6  ·  3评论