Find Jobs
Hire Freelancers

Develop a Photo Clustering System

£250-750 GBP

En curso
Publicado hace alrededor de 11 años

£250-750 GBP

Pagado a la entrega
The requirement is to build a process/pipeline that can take a table (literately a database table) of information about geographically located photos, and place them into meaningful but subjective groups or clusters. There are many 'dimensions' to the data that could be used to perform the clustering, including geographical coordinates, locality (town/country etc), date taken, textual tags (Folksonomy), and photographer. There is also freeform title and description, but we've already extracted automated terms from these, so don't need to process freeform text. All of these should/could be used to perform the clustering, eg "taken by Joe blogs in April 2012" could be a arbitrary cluster. Clustering should ideally make use of the geographical coordinates, to create clusters of nearby photos (which have some other theme - such as taken by a particular user), but not limited to it, where possible multiple dimensions should be used. The photographer is a good candidate for clustering because often a given photographer will take similar photos in the same geographical area on any given day. It will require two modes, 1) 'priming' where a large number (over 3 million ultimately!) of photos are taken and put into clusters. and 2) 'updates' where batches of images are added (about 1000 at a time), which require placing into the existing clusters or creating new ones. The 'update' mode should aim to where possible add to current clusters , it could delete and then recreate some clusters if how have a better fit, but also needs to be able to create new clusters where needed. In particular, it should be differential, most clusters will remain the same, only a few changing, it shouldn't just delete all the clusters and start again. The two modes are closely related, and will be largely similar probably (eg priming could just be lots of 'updates' with initially no clusters, but there could be some optimization possible to tailor for the two modes. The aim would be to have every photo placed in one or more cluster, and ideally clusters should be somewhere on the order of 5-200 images. If a cluster grows much beyond 200 it should be a candidate for splitting. Ideally each cluster should have a label that describes it eg "photos near Reading" If K-means or similar is used to cluster geographically, it should be an adaptive algorithm, without having to specify K. ie it works out a good number of clusters to create, not aim to create say 30 clusters. [login to view URL]~wilkinson/Applets/[login to view URL] A sample dataset can be supplied (say a table of 120,000 images), but the 'full' data set of 3.4M images could be used too. For a tiny sample, showing the range of columns available, see [login to view URL] It can be written in any language (PHP, Python, Java etc), but needs to be able to run fairly self contained on a Linux server. MySQL would be the ideal backing database (downloading the data from mysql, and creating the clusters in a mysql table) - but others can be considered if offer a tangible benefit (eg postgre/postgis). The full source code - and the means to compile/run it will be required. The eventual aim would be to release the source as opensource. (keep the credit yourself, or assign it to us) To be clear the requirement is not to come up with the perfect clustering system, as noted the clusters are subjective. But to build the framework - with a working clustering method - but so that the exact parameters can be tweaked as required.
ID del proyecto: 4405177

Información sobre el proyecto

10 propuestas
Proyecto remoto
Activo hace 11 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos
Adjudicado a:
Avatar del usuario
Hi I'm a statistician form Uruguay and I have plenty of experience in data mining and data analysis
£935 GBP en 30 días
5,0 (1 comentario)
2,1
2,1
10 freelancers están ofertando un promedio de £734 GBP por este trabajo
Avatar del usuario
Hi, I can help you.
£735 GBP en 3 días
5,0 (1 comentario)
2,7
2,7
Avatar del usuario
Hello, greatly interested in serving for you, any difficulies will be no problem for me in image clustering.
£750 GBP en 13 días
3,0 (1 comentario)
2,8
2,8
Avatar del usuario
Hello, We specialize in Image clustering and will be able to complete the task as per your specifications. Please find complete details over PM. We hope to hear from you at the earliest.
£770 GBP en 20 días
5,0 (1 comentario)
2,4
2,4
Avatar del usuario
I can do this task for you, please check PMB.
£525 GBP en 15 días
5,0 (3 comentarios)
2,1
2,1
Avatar del usuario
I can help you in your project.
£400 GBP en 25 días
5,0 (1 comentario)
1,0
1,0
Avatar del usuario
hire me...!!!! i have done master in cs and working in hcl for past 5 years i can do this you you in 30 day
£750 GBP en 30 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
We have developers with skills required to do this project and can provide you best solution in php or python.
£550 GBP en 25 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
Hi Barry, I run a company specializing in Machine Learning and Social Media Analysis. I have submitted a detailed approach in my private message. Thank you.
£1.100 GBP en 30 días
0,0 (0 comentarios)
0,0
0,0

Sobre este cliente

Bandera de UNITED KINGDOM
Ffestiniog, United Kingdom
5,0
3
Forma de pago verificada
Miembro desde feb 15, 2004

Verificación del cliente

¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.