make two tables having same orders in both column and row names

Simon Forman sajmikins at gmail.com
Thu Nov 19 17:48:11 EST 2009


On Wed, Nov 18, 2009 at 3:57 PM, Ping-Hsun Hsieh <hsiehp at ohsu.edu> wrote:
> Hi,
>
> I would like to compare values in two table with same column and row names, but with different orders in column and row names.
> For example, table_A in a file looks like the follows:
> AA100   AA109   AA101   AA103   AA102
> BB1     2       9       2.3     1       28
> BB3     12      9       2.3     1       28
> BB9     0.5     2       2.3     1       28
> BB2     2       9       21      1       20
>
> Table_B in the other file looks like the follows:
> AA101   AA109   AA100   AA103   AA102
> BB1     2       9       2.3     2       28
> BB2     2       9       2.3     1       28
> BB9     2       9       2.3     1       28
> BB3     2       2       2       1       28
>
> Can anyone give an efficient way to make the two tables having same orders in column and row names so I can easily and correctly compare the values in positions?
>
> Thanks,
> PingHsun

Depending on the kind of comparisons you want to do and the sizes of
your input files you could build a dict of dicts.
For example:


def generate_dict_values(F):
    column_names = f.readline().split()
    for line in F:
        row = line.split()
        yield row[0], dict(zip(column_names, row[1:]))


# Fake a file.
from StringIO import StringIO
f = StringIO('''\
AA100   AA109   AA101   AA103   AA102
BB1     2       9       2.3     1       28
BB3     12      9       2.3     1       28
BB9     0.5     2       2.3     1       28
BB2     2       9       21      1       20
''')


d = dict(generate_dict_values(f))

# Now you can access the values without regards to the order in the
file like so:

d['BB9']['AA109']



# "Pretty print" the dict of dicts.
from pprint import pprint
pprint(d)

# Prints:
#
#    {'BB1': {'AA100': '2',
#             'AA101': '2.3',
#             'AA102': '28',
#             'AA103': '1',
#             'AA109': '9'},
#     'BB2': {'AA100': '2',
#             'AA101': '21',
#             'AA102': '20',
#             'AA103': '1',
#             'AA109': '9'},
#     'BB3': {'AA100': '12',
#             'AA101': '2.3',
#             'AA102': '28',
#             'AA103': '1',
#             'AA109': '9'},
#     'BB9': {'AA100': '0.5',
#             'AA101': '2.3',
#             'AA102': '28',
#             'AA103': '1',
#             'AA109': '2'}}



More information about the Python-list mailing list