Python pandas dataframe for dictionary

Question

Python pandas dataframe for dictionary

I have a two-frame dataframe and I intend to convert it to a python dictionary - the first column will be the key, and the second will be the value. Thank you in advance.

Dataframe:

id value 0 0 10.2 1 1 5.7 2 2 7.4

+47

python dictionary pandas

perigee Sep 09 '13 at 9:49 on

source share

5 answers

joris · Answer 1 · 2013-09-09 09:55

See docs for to_dict . You can use it as follows:

 df.set_index('id').to_dict()

And if you have only one column to avoid the column name, this is also the level in the dict (in fact, in this case you are using Series.to_dict() ):

 df.set_index('id')['value'].to_dict()

DSM · Answer 2 · 2014-06-23 16:08

If you want an easy way to keep duplicates, you can use groupby :

 >>> ptest = pd.DataFrame([['a',1],['a',2],['b',3]], columns=['id', 'value']) >>> ptest id value 0 a 1 1 a 2 2 b 3 >>> {k: g["value"].tolist() for k,g in ptest.groupby("id")} {'a': [1, 2], 'b': [3]}

praful gupta · Answer 3 · 2016-10-03 17:41

 mydict = dict(zip(df.id, df.value))

dalloliogm · Answer 4 · 2014-06-23 14:35

The joris answers in this thread and punchagan in the duplicated stream are very elegant, however they will not give the correct results if the column used for keys contains any duplicate value.

For example:

 >>> ptest = p.DataFrame([['a',1],['a',2],['b',3]], columns=['id', 'value']) >>> ptest id value 0 a 1 1 a 2 2 b 3 # note that in both cases the association a->1 is lost: >>> ptest.set_index('id')['value'].to_dict() {'a': 2, 'b': 3} >>> dict(zip(ptest.id, ptest.value)) {'a': 2, 'b': 3}

If you have duplicate entries and don’t want to lose them, you can use this ugly but working code:

 >>> mydict = {} >>> for x in range(len(ptest)): ... currentid = ptest.iloc[x,0] ... currentvalue = ptest.iloc[x,1] ... mydict.setdefault(currentid, []) ... mydict[currentid].append(currentvalue) >>> mydict {'a': [1, 2], 'b': [3]}

user1376377 · Answer 5 · 2017-10-23 16:29

Another (slightly shorter) solution for not playing duplicate records:

 >>> ptest = pd.DataFrame([['a',1],['a',2],['b',3]], columns=['id','value']) >>> ptest id value 0 a 1 1 a 2 2 b 3 >>> pdict = dict() >>> for i in ptest['id'].unique().tolist(): ... ptest_slice = ptest[ptest['id'] == i] ... pdict[i] = ptest_slice['value'].tolist() ... >>> pdict {'b': [3], 'a': [1, 2]}

Python pandas dataframe for dictionary

More articles: