作者:智勇双全882602900857_984 | 来源:互联网 | 2023-09-18 16:03
IhavethefollowinginmyGooglecloudstorage我在Google云存储中有以下内容Advertiser|Event______________
I have the following in my Google cloud storage
我在Google云存储中有以下内容
Advertiser | Event
__________________
100 | Click
101 | Impression
100 | Impression
100 | Impression
101 | Impression
My output of the pipeline should be something like
我的输出管道应该是这样的
Advertiser | Count
100 | 3
101 | 2
First I used groupByKey, the output is like
首先我使用了groupByKey,输出就像
100 Click, Impression, Impression
101 Impression, Impression
How to proceed from here?
怎么从这里开始?
2 个解决方案