Chronicle Of A Data Scientist/analyst - Programming (47) - Nairaland
Nairaland Forum › Science/Technology › Programming › Chronicle Of A Data Scientist/analyst (413241 Views)
1 2 3 ... 44 45 46 47 48 49 50 ... 148 Reply (Go Down)
| Re: Chronicle Of A Data Scientist/analyst by yemyke001(m): 4:51pm On Jun 09, 2020 |
KunSegzy100:I will be glad if you can share the site where you see the opening and my mail is yemyke@gmail.com.. I will be more glad if you can remove your vital info from your CV and help send the carcass to my mail. Thank you in anticipation. |
| Re: Chronicle Of A Data Scientist/analyst by yemyke001(m): 5:05pm On Jun 09, 2020 |
I will be glad if you can help with a template of your CV to model mine and good sites to search for opening. Thanks |
| Re: Chronicle Of A Data Scientist/analyst by yemyke001(m): 5:06pm On Jun 09, 2020 |
mcemmy0z:I will be glad if you can help with a template of your CV to model mine and good sites to search for opening. Thanks |
| Re: Chronicle Of A Data Scientist/analyst by Boxers02: 5:23pm On Jun 09, 2020*. Modified: 1:36pm On Jun 14, 2020 |
| Re: Chronicle Of A Data Scientist/analyst by mcemmy0z: 6:46pm On Jun 09, 2020 |
yemyke001:Download indeed app on Google play store register as a data analyst and wait for data analyst job on your mail everyday. |
| Re: Chronicle Of A Data Scientist/analyst by yemyke001(m): 8:02pm On Jun 09, 2020 |
mcemmy0z:Thanks Bro |
| Re: Chronicle Of A Data Scientist/analyst by Gcool2(m): 8:10pm On Jun 09, 2020 |
mcemmy0z:pls I am just a learner, curious to know something..pls u indicated that the guy should store his code somewhere.. Do pple have any particular code they work with? I believe code will be based on the task given,How will he know the task he would be given to be sure that his code will work. ?. pls enlighten me more or I want to get the part u said "store your code". |
| Re: Chronicle Of A Data Scientist/analyst by Nobody: 8:33pm On Jun 09, 2020 |
Boxers02:are they udemy videos ? |
| Re: Chronicle Of A Data Scientist/analyst by mcemmy0z: 9:33pm On Jun 09, 2020 |
Gcool2:If you familiarize yourself well enough to python for data preprocessing you will realize that most of the process are same, from importing all necessary libraries to importing your data, checking the head, doing some descriptive analysis on it, removing NAN values or dropping unwanted columns, melting columns, changing columns name.. there are sepeate code for all these, but have saved everything somewhere cause I don't have time to code, I just have to rename the data name to the one am using. Don't know when last I even type code. Same with machine learning, both supervised and unsupervised, whether to create dummie variable or I want to scale data. I saved all the code there for each machine learning model too ranging from linear to logistics regression, KNN, decision trees etc.. even when using Visualization libraries like seaborn there are some crazy plot that I can't even cram the code so I have to save them all and all those code for different lines, colors too. Working smart is what it takes.
|
| Re: Chronicle Of A Data Scientist/analyst by Boxers02: 10:03pm On Jun 09, 2020 |
Electricboy:Yes they are |
| Re: Chronicle Of A Data Scientist/analyst by Gcool2(m): 10:13pm On Jun 09, 2020 |
mcemmy0z:Thanks for this explicit response ..pls check ur mail. |
| Re: Chronicle Of A Data Scientist/analyst by Theflint1(m): 12:44pm On Jun 10, 2020 |
Macsjebs check the visualization for top front-page posters. cochtrane: |
| Re: Chronicle Of A Data Scientist/analyst by Grandlord: 1:02pm On Jun 10, 2020 |
Saw this somewhere and I thought it might help someone on here organize their learning process. You're welcome ![]()
|
| Re: Chronicle Of A Data Scientist/analyst by cochtrane(m): 1:15pm On Jun 10, 2020 |
cochtrane:Following up on this dataset, I started wondering, can one define a machine learning question with this data set given its limited number of features? Apparently yes! In this second part, I examine procedures for fitting a model with this data set. The research question takes the form: given a post title, can one tell which section it is from? For example, given a title "COVID-19: Governor Ikpeazu's Two Aides Test Positive", can our model tell that it is from the Health section? Using NLP procedures, one can design a machine learning model which takes some part of this data set and fits a model to it, so that with the test data, we can ask questions of the remainder titles. This is a typical unsupervised model design known as classification. This particular task is multi-class classification with about 37 classes (all sections on Nairaland). This is a little harder than binary classification which has just two labels, because there are many more labels and the chances of being right for any one prediction is quite low (1/37 in this case, if we consider independence). Before fitting, I generated a wordcloud to see which words are the most prominent. Apparently, "buhari" has been a prominent word over the past year on Nairaland's front page. Little wonder it chances of occurrence was quite high in the initial analysis I did. "lagos" is also prominent. And unsurprisingly, "coronavirus" is also. SVM with a linear kernel was used for the classification task, and worked quite well. Ended up with an overall accuracy of about 69%. For some specific keywords, the accuracy was even higher. For example, for the keyword "buhari", the model placed the frontpage topic in "Politics" all the time and was correct for 96% of the time. For the keyword "rape", it had a choice of three different sections and still managed an accuracy of 84%. For the keyword "coronavirus", it didn't do so well. Managed only an accuracy of 69%. In any case, it shows that some of these predictions are possible. One can probably improve this model by training it on more features such as number of posts, post author, time of post. More features should improve its accuracy. I may get around to that if I've got more time. If you are able to do it, drop a message.
|
| Re: Chronicle Of A Data Scientist/analyst by cochtrane(m): 1:18pm On Jun 10, 2020*. Modified: 9:48pm On May 31, 2021 |
Finally. this won't be complete without mentioning the resultant "Confusion Matrix". Managed to create a visualization for it. When visualized, we see that for most of the sections, the correct prediction was made. There were a few sections where probably not enough values to form a cell. These appear to be "Programming", "Pets", etc. Red cells mean zero. For example, there were no successful predictions for "Webmasters". Light cells mean successful predictions. Most of the cells along the diagonal are light, correlating with the fairly good accuracy obtained. If this catches your interest, you can download the notebook here on my github and play around with it, if you want. The code is in R.
|
| Re: Chronicle Of A Data Scientist/analyst by Gcool2(m): 2:49pm On Jun 10, 2020 |
Guys,I want to drop two vital information. 1. Pls,I will advise everyone to get a Twitter account.I get motivated towards Tech everyday cos of the community of pple I surround myself with,both online and offline.You can try using Twitter and add some Tech gurus who through their posts share more insights towards programming.I will recommend @DThompsonDev .This guy will simplify every problem you feel you are encountering as a beginner. 2.If you are a beginner,I recommend "Automate the boring stuff udemy course" for you as the first step towards learning data science or data analysis.With this,other stuff you would be learning would be easier.For the beginners who started with hash analytics internship,I have spoken with a lot and I cam conclude the lack of python basics serve as setback towards understanding major concepts. Here is the link for the free udemy course. https://udemycoupons.me/automate-the-boring-stuff-with-python-programming-100-off/ pls you can follow me on Twitter and let's build each other,I follow back ASAP. Follow me on Twitter @DatumCue I am a beginner like you,I want to see us Succeed . |
| Re: Chronicle Of A Data Scientist/analyst by Gcool2(m): 2:55pm On Jun 10, 2020 |
Theflint1: |
| Re: Chronicle Of A Data Scientist/analyst by Gcool2(m): 3:02pm On Jun 10, 2020 |
cochtrane:cochtrane,you have done a wonderful job... Well-done...You made my day with this insight.keep it up.I will pm you. |
| Re: Chronicle Of A Data Scientist/analyst by cochtrane(m): 4:24pm On Jun 10, 2020 |
Gcool2:Thanks man |
| Re: Chronicle Of A Data Scientist/analyst by ibromodzi: 9:12pm On Jun 10, 2020 |
cochtrane:Man you are on another level.... Seun should employ you. I'll like to ask what you use for NLP; spacy, NLTK or Textblob? |
| Re: Chronicle Of A Data Scientist/analyst by Singingbae(m): 12:19am On Jun 11, 2020 |
mcemmy0z:Good evening sir, please I don’t know how I can meet you, I reside in Agege, but I can come to Sango one of these weekends if you can give me which is comfortable for you, I just finished python and I just got admitted to a virtual data science boot camp, I just want to be ahead of them! Thanks in anticipation |
| Re: Chronicle Of A Data Scientist/analyst by Gcool2(m): 1:51am On Jun 11, 2020 |
Visit this site daily for free Udemy courses https://udemycoupons.me/100-off-udemy-coupons-codes-verified-for-today/?amp |
| Re: Chronicle Of A Data Scientist/analyst by cochtrane(m): 6:53am On Jun 11, 2020 |
ibromodzi:The code is written in R, and uses the tm package |
| Re: Chronicle Of A Data Scientist/analyst by ibromodzi: 8:25am On Jun 11, 2020 |
cochtrane:Alright! Thanks. |
| Re: Chronicle Of A Data Scientist/analyst by elunico: 9:38am On Jun 11, 2020 |
Gcool2:That course isn't free. |
| Re: Chronicle Of A Data Scientist/analyst by Gcool2(m): 12:22pm On Jun 11, 2020 |
elunico:They revoked it..it was free..I downloaded couple of free courses there |
| Re: Chronicle Of A Data Scientist/analyst by elunico: 4:59pm On Jun 11, 2020 |
Gcool2:OK. |
| Re: Chronicle Of A Data Scientist/analyst by DrinkWater10: 10:55pm On Jun 11, 2020*. Modified: 1:34am On Jun 13, 2020 |
Ejiod: |
| Re: Chronicle Of A Data Scientist/analyst by DrinkWater10: 2:44am On Jun 12, 2020 |
Graspad:Hi. Tried downloading tutorial files from freetutorial website but I keep getting a compressed torrent file. How can I get around it? |
| Re: Chronicle Of A Data Scientist/analyst by Mikechinos(m): 6:14am On Jun 12, 2020 |
DrinkWater10:download it with UC browser |
| Re: Chronicle Of A Data Scientist/analyst by Generalkorex(m): 9:24am On Jun 12, 2020 |
Kaycee54321:Link to video please |
| Re: Chronicle Of A Data Scientist/analyst by BoleAndFish: 9:41am On Jun 12, 2020 |
DrinkWater10:What course were you trying to download? |
Chronicle Of A Data/cloud Engineer • Net Salary For A Data Analyst Or Scientist Or Web Dev • Aspiring Data Scientist. • 2 • 3 • 4
I Want To Learn Programming. Which Language Should I Start With?
