Find Jobs
Hire Freelancers

Big data entity resolution in NoSQL database

¥240-2000 CNY

Cerrado
Publicado hace casi 7 años

¥240-2000 CNY

Pagado a la entrega
There’re several collections storing documents containing company entity information. These collections record different information relevant to company entities, such as executives, accountants, products and investment. Based on the type of information gets stored, individual collections are different by their field name and structure, but also share certain overlaps, such as company name, geo location, contact info, industry keyword and official website. Now we need to link documents about the same entity across all different collections, the obstacles we’ve encountered are: 1. Since the source of the data are different, company names belongs to the same entity appeared differently across collections. Since some names are in full name, some are in abbreviation, some are in Pinyin and some are simply initials of English name, it’s hard to completely match documents on the same entity. 2. Different collections contain different fields, and not all collections have contact information and website as fields. All collections may only share company name as the only common field, hence it’s hard to establish a unified matching rule. If we are using the Apache Spark framework to solve this entity resolution problem, what algorithms offer the best performance in terms of precision and feasibility? The largest collection has size around 20,000,000 documents. We need to find an outsource specialist who has done projects or experience in: 1. Big data entity resolution in NoSQL database 2. Over two years experience in Apache Spark and MongoDB Attachment
ID del proyecto: 14290277

Información sobre el proyecto

5 propuestas
Proyecto remoto
Activo hace 7 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos
5 freelancers están ofertando un promedio de ¥1.195 CNY por este trabajo
Avatar del usuario
Hello. Good to see another serious posting. I don't usually look for new clients but I happened to see your job post and I wanted to contact you. I’ve read your brief and I could absolutely help you with your goal. I have 10+ years experience designing and developing mobile apps for iPhone and Android and building Website so we can get the success of your idea. I would approach your project by starting with wireframes and getting the design completed, before starting the actual development phase. I am highly qualified for this project and would love to speak with you further about taking this project on. If you'd like to view my previous work, take a look at my Freelancer Portfolio. Hope to you call me on chat. Thank you for taking the time to read my application. Cheers, Lang
¥1.244 CNY en 3 días
5,0 (1 comentario)
3,2
3,2
Avatar del usuario
Hey We are a team of Technical Developers and have got expertise in such stuff. Ping me if you are looking for a quick resolution
¥1.248 CNY en 7 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
Hi, this is Fatima. I have been researching and have found two native Spark solutions for your problem, plus Duke. It will work. Best regards. Relevant Skills and Experience I have been working with Spark and Scala for 3 years. Proposed Milestones ¥200 CNY - Analysis ¥800 CNY - Test run, small data set ¥800 CNY - Test run, big data set ¥200 CNY - Project finished Additional Services Offered ¥200 CNY - Program maintenance Would you like an expandable solution? Hire me.
¥2.000 CNY en 14 días
0,0 (0 comentarios)
0,0
0,0

Sobre este cliente

Bandera de CHINA
China
0,0
0
Miembro desde jun 9, 2017

Verificación del cliente

¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.