Enhance API of captures() to enable retrieval of ALL groups at once, as a dictionary

Issue #86 resolved
Marcin Wojnarski created an issue

Hi,

For non-repeated groups, one can use match.groupdict() to retrieve a dictionary of ALL groups and their values, including un-matched groups. But there is no equivalent for repeated groups: match.captures() only returns values for groups given explicitly in arguments, while groupdict() doesn't include multiple values.

I suggest either:

  1. Change API of captures() so that captures() (no args) returns a dictionary of ALL groups, not just group 0 - this would be the most convenient and intuitive, but would break existing code if somebody relies on this feature.

  2. Add a boolean argument to captures(), say "all", equal False by default, to let the client indicate that a full dictionary is expected.

  3. Add new method, say capturesdict() to return dict of all groups.

Thanks
Marcin

What version of the product are you using? On what operating system?

0.1.20130120
Linux, Python 2.7.2

Comments (5)

  1. Former user Account Deleted

    Should the dict behave like this?

    capturesdict = {}
    for name in m.groupdict().keys():
        capturesdict[name] = m.captures(name)
    

    What's your usecase? Could you provide some examples of the suggested feature?

  2. Marcin Wojnarski reporter

    Yes, it should behave in this way.

    Usecase: web scraping, extraction of many different values from a complex html page in one go (for example, profile page of a product, with different properties listed in a fixed layout) - after applying a regex the next step is to take *all* extracted data as a dict, not one by one.

  3. Former user Account Deleted

    Could you provide some simple test cases?

    I think it'll be called 'capturesdict'.

  4. Log in to comment