Data analysis project
$250-750 USD
Pagado a la entrega
I need data analysis project by python.
Let's assume that we would like to collect labels for each column as Organization, Person, Address and Other from 250,000 different datasets. For instance, different column names such as vendor_name, business, name, corporation and parent_company can be used to represent Organization and it becomes difficult to label each column manually when you have a large number of datasets. Explain your ideas and methods to efficiently obtain labels in as much detail as possible. After that I will award you.
Nº del proyecto: #21229973
Sobre el proyecto
26 freelancers están ofertando un promedio de $523 por este trabajo
Hello, I have gone through your job posting and become very much interested to work with you. I am an expert in this field. I have already completed several projects like this. For evidence you can see my profile. Pl Más
I am a Data Scientist with 3+ years of experience in Data Analysis, Statistical Modelling, Machine Learning, Deep Learning, Computer Vision and Natural Language Processing. I have worked across various domains such as Más
Dear sir. Your project attracted my attention at first glance, because I've extensive experience in Data Analysis Programming. I'm really confident about your project, and very eager to join your project. If we have a Más
HI, I am data scientist and have good experience in python and R programming. My area of interest is statistical Analysis of dataset and apply ML/deep learning algorithm. I can intern your tasks. Kind Regards
Hi, I can help you get this done. I have skills in Python, Data Processing, Machine Learning (ML), Data Mining, Statistical Analysis
Hi, First of all, your explanation was not very clear to me. Do we need to categorize the column label or something else? I am not able to understand your explanation completely. Please share more detail as I here Más
Hello sir. I'm excited about your project, because I've really rich experience in Data Analysis Programming. I've developed many projects similar to yours and excellent skills. If you award me, I'll provide wonderfu Más
hi, I'm a professional statistical analyst seeking opportunity to provide highest quality services in the following areas of Statistics and Econometric. Looking for outstanding opportunities to apply my academic creden Más
Hello, I've read your project requirements thoroughly. The most possible solution for your problem would be to collect all the keywords (column names) through a program then filter them out for unique values. After th Más
It is a job access can do it. Just simple filtration of Variable, if your data is stored. I can do your project efficiently
Hello, For what I have understand is you need to automate the process of labeling the columns from around 250000 datasets into the finite columns like organization , etc. I would propose to create a dictionary for ever Más
My approach would be to: 1. Collect all column names from the different data sources 2. Apply basic text processing steps and regular expression methods like removal of special characters and stopwords, treatment of st Más
My preferred method of freelancing is an interactive approach to project solving. I have an MSEE specializing in Digital Signal/Image/RF Processing. I do my work in MATLAB (expert). I also do Python programming.
Data cleaning & extraction can be done various pattern matching and regrex based on dataset given. Custom algorithm to get more effiecient extraction based on given input All models will be coded in python, so all ma Más
That's Simple , NER - Named Entity Recognition with SpaCy or NLTK , will do your task . Process will involve from reading column names to categorizing - with tokenizing , chunking , etc . to your datasets , actually wh Más
0/ Make data mining based on your datasets 1/ Create a list of words from all datasets using bag of words 2/ display those words on the screen 3/ use the list of stop words and create rules that will be used for separ Más