I have a DataFrame:
+-----+--------+---------+
| usn|log_type|item_code|
+-----+--------+---------+
| 0| 11| I0938|
| 916| 19| I0009|
| 916| 51| I1097|
| 916| 19| C0723|
| 916| 19| I0010|
| 916| 19| I0010|
|12331| 19| C0117|
|12331| 19| C0117|
|12331| 19| I0009|
|12331| 19| I0009|
|12331| 19| I0010|
|12838| 19| I1067|
|12838| 19| I1067|
|12838| 19| C1083|
|12838| 11| B0250|
|12838| 19| C1346|
+-----+--------+---------+
And I want distinct item_code
and make an index for each item_code
like this:
+---------+------+
|item_code| numId|
+---------+------+
| I0938| 0 |
| I0009| 1 |
| I1097| 2 |
| C0723| 3 |
| I0010| 4 |
| C0117| 5 |
| I1067| 6 |
| C1083| 7 |
| B0250| 8 |
| C1346| 9 |
+---------+------+
I don't use monotonically_increasing_id
because it returns a bigint.