We have an existing web analytics platform that requires some bug fixes. It is a Scala app that reads events from Kafka and writes to HDFS for consumption via hive/presto. The app has some small bugs:
* some events are duplicated
* write failures to HDFS results in dropped events
* other minor optimizations
-------------------------------------------------------------------------------------------------
Please describe your scala and spark experience.
Hi,
I am a certified bigdata developer, used spark and Scala in most of our applications.
I used spark streaming with kafka to read data from kafka topics and save it into hdfs.
Please let’s connect and discuss more on your requirements.
Thanks,
Naresh.
Hi, I've been working on Hive, Kafka, Spark & Scala from 6 yrs currently working for a TOP US based retail chain.
I hope I can help you in your work and stabilize your product and resolve your issues at the earliest.
Best wishes.
Hi,
I'm an experienced big data developer proficient with complete big data stack. I have good knowledge of Scala and Spark along with it's streamed processing, getting data from kafka topics processing them in spark then writing the results to HDFS.
Request you to kindly go through my profile to get confidence on my big data skills and quality of delivery.
Let's connect over chat to discuss more on this.
Looking forward to work for you.
Regards,
Vinod
Hi,
I have experience in Spark kafka streaming. offset management, handling topic partitions and spark RDD transformations.
can help resolve your issues.
please reach me.
Thanks,
Mohanakrishnan