Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 4:11pm On Jun 06, 2020 |
Zabiboy:
For DAX Expression exercises in Power bi, check wiseowl . co . uk, w3resource . com, kaggle .com. For exercises without DAX Expression, you can use same sites... Basically, what can be done in Tableau can be done in power bi...so if you try out any tableau project, you can replicate too in power bi... If you are on twitter, you can search for #makeovermonday....every sunday, a dataset is dropped and we submit our analysis in Tableau on monday or latest by Wednesday....Its a chance to improve yourself GL 
Thank you and God's abundant blessings on you |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 1:19pm On Jun 06, 2020 |
tensazangetsu20: Work on your CV. Make sure that the jobs you are applying for the keywords are present on your CV.
Then get lots of projects nice good projects that make sense Bro where can one get projects to work on?
put them on your portfolio and GitHub and finally get used to rejection. Op himself got over 50 rejections before he got his first job.
Clement founder of algoexperts applied to over 200 jobs before his job at google. Just set a target. 10 jobs daily at least and don't limit your search to Nigeria. Focus on the things you can control.
The great thing about tech is the jobs are there unlike some industries where there aren't any vacancies so you just send out CVs and pray to God.
And lastly learn data structures and algorithms. You will need it for coding interviews. It's something you can learn while applying to jobs. Also join Upwork and try your hands at freelancing. |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 1:07pm On Jun 06, 2020 |
Zabiboy:
I know i'm not the one this question is directed to but the truth is Power Bi and Tableau are quite straifht-forward and not difficult to learn... Almost everything is been done already.. The only kind of difficult stuff in Power Bi is the DAX function, which is Calculated field in Tableau... Generally, You can use 2-3 weeks to learn either of them,
and with constant practice, in 1 or 2 months, you should be a Pro.. Can you suggest a place where one can practice to become a pro? I have taken courses on them but not yet a pro for lack of practice
Like i said, in both of them, most of the work has already been done for you by the developers.. The only hindrance is that neither of them can be used to web-scrape data GL 
Thanks bro all responses are welcome. |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 7:31am On Jun 06, 2020 |
Hardheolar: Hi guys, I did further digging on the dataset that cochtrane scraped using a business intelligence tool called Power BI. Note: The analysis is for the date when the thread made it to front page, not when it was created which spanned from 3rd of June 2019 to 2nd of June 2020 . The data has 28,516 threads from 38 different sections by 5025 different accounts.
Few insights from the dataset -dre11 is the king of political threads with 361 threads followed by Islie(329) and ijustdey(225) during the time captured. Lala is more interested in celebrity threads compared to politics, followed by Alex. Ogbiwa is the defending champion of sports threads. -Threads are mostly pushed to front page in the morning, which is reasonable since that is when the day begins. -There was spike in threads that were pushed to front page in July 2019, but I can't tell if that is the norm during that period since we don't have previous year's data to make the comparison. -You will think that threads in religion section should top the threads that make it to front page on Sundays, but it came third after politics and celebrity threads. -853 threads made it to front page with "kill" keyword. That is worrisome and a cause of concern. e.tc. Lots of insight can be derived from the data. Below is a link to the report. It is interactive, so feel free to play around with it.
https:///3gXsRlq Wow this is fantastic. Very smart analysis. I am impressed. So bro for those of us coming behind, can you let us know: 1. How long you have been learning programming and datascience 2. What courses did you take to bring you to this level 3. Any other words of encouragement |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 4:06pm On Jun 04, 2020 |
brashear: Maybe the csv file is not in the same directory as python/anaconda. you should write the whole csv path directory in your code.
pd.read_csv('c:user\desktop\filename.csv') Try this: pd.read_csv('c:/user/desktop/filename.csv') Note use"/ " instead of " \". |
Programming › Re: Programming Learning Challenge June. by Dum20: 2:16pm On Jun 04, 2020 |
lalasticlala this thread is also great for front page.
There are many interesting threads in this section |
Programming › Re: SQL Basics & Advanced Topics: Free Webinar! by Dum20: 1:19pm On Jun 04, 2020 |
Sent you a pm |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 7:56am On Jun 04, 2020 |
lalasticlala: Yes if there are interesting ones First of all thanks for acting on the suggestion. I saw a thread from this section moved to front page. But i would like to say that more can be done. Everything about programming is interesting. Many people are not aware there is a section like that. Check this thread for example, it is very, very educative and is helping people who want to change careers, or start a new career. There are many threads like that. So Lala, please do more. |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 2:28pm On Jun 03, 2020 |
lalasticlala: You tried Lala can you push some threads in the programming section to first page once in a while |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 2:25pm On Jun 03, 2020 |
[quote author=cochtrane post=90262126] I started with selfng learni Please can you explain more by starting with self learning. But now am taking some courses online (edX, dataquest and Datacamp) Not a lot. I don't even think am so competent just yet. Still work in progress. But most of the courses I have taken have been by myself, online. Scraping Nairaland was with BeautifulSoup, and I just used the documentation. Six months, give or take. I have been learning python for 9 mnths. Do you advise learning Python first before going into Data Science or study Data Science courses that have Python as part of the subjects to be learnt. Try to get into the act of doing, not only while you are watching the tutorials, but also at other times. Can you give example? Most of the courses have exercises. Doing makes you learn a lot. Start with something that interests you. If you like Nairaland, you may actually start with that nairaland dataset. -Use jupyter notebook. Most of the libraries are pre-installed -Import the required libraries -Scraping was with beautifulsoup and you can consult the documentation online -used pandas to clean the data. It was very dirty. -Used seaborn to visualize the data. There are other options, including plotly. Make sure to practice real-life examples. Can you suggest where to get real life examples? Thanks for your answers so far. |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 10:39am On Jun 03, 2020 |
cochtrane: Thanks! Just drop a message here if you've got a question. Okay great. 1. Firstly how did you start? 2. What are the courses you took to get to this stage of competence. 3. For how long have been in Data Science My background i am taking a Bussiness Intelligence course on Udemy. I have taken the statistics, SQL, Tableau parts of the course. I am just started the python section. But i still feel inadequate. I have also taken courses on Excel Power tools and Power BI. N.B. Can you give a step by step guide on how you got to final charts above. You do not need to go into details. Something like: 1. You used XYZ to extract data 2. You used ABC to clean the data 3. Used MNOP to visualise the data Thanks There are a lot of courses to study. I am wondering if i should stop for now and practice real life examples of the subjects i have learnt. |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 8:27am On Jun 03, 2020 |
cochtrane: As a budding data scientist who visits NL often, it's not surprising that you start to get more than interested in the some of the topics making front page and how frequent topics from individual sections reach the top. I have been looking into this for a while and thought it would be nice to do some investigation in this regard. For example, which section makes front page most often? How often do we see programming topics get to the front page? Who posts more often on the front page? Is it really lalasticlala, as is frequently supposed, or is it someone else? What exactly has been the relationship between lalasticalala and snakes over the past year? Some people think he loves to push snake topics to the frontpage more often than other topics. What else can we learn from the topics making frontpage? Like for example, are they mostly about Buhari or something else?
To this end, I scrapped the front page data and obtained more than 28,000 records. You can download this data set I obtained here on my github. If you are data science enthusiast who also likes Nairaland, this may be good motivation to dig into a topic that interests you. You will find a metadata file in the sublink as well and can investigate what the attributes are about. You've got titles, links, sections and time that posts made front page. It's a year of data from 31st May 2019 till date. It turns out to get the whole frontpage information may need more than 230,000 records! That's huge, and probably not so wise to collect for a quick, lazy analysis. Except, of course, you have business motives
For me, I was interested in a few topics. First, from which section did we get the most frontpage material over the past year? Apparently, it is "Politics". It trumps everything. "Celebrities" come a close second, and then "Crime". "Programming"? Didn't even make bank one time!
Who posts more often on the frontpage? Not lalasticlala like you might think. It's a person called dre11; at least over the last year. Maybe you know him, may you don't. Lalasticlala is not even in the top three.
One quirky thing I found, however, was that the time it takes for a post to get to frontpage has a heavily right-skewed distribution. Before plotting this, I lazily thought it might be normally distributed, cos...well, a lot of things are normally distributed and it shouldn't be unusual to have this normally distributed as well; few make front page early, few late, and most are in between. I feel the heavy skewness probably points to deliberate human intervention. Most posts make front page early, not late. They are created and in little time pushed to the front page. That's what my data shows. Maybe, better insight could be derived though if one scraped randomly over the past several years in order to obtain a truly random sample.
And there were a few threads which made front page late. Very late! In the past year, we have had threads from 8 years ago make frontpage. Yes, 8 years ago! And then there are those that were initially posted 5 years ago before they made front page. Perhaps you can find more if you looked into the data set?
Anyways, getting your hands dirty with a data set is always a good way to learn data analysis. If you need help with navigating this, you can buzz me. Bro this is fantastic and awesome. I have plenty questions for you later |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 3:38pm On May 30, 2020 |
Let se how this comes out
|
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 3:33pm On May 30, 2020 |
And this is the last part |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 3:32pm On May 30, 2020 |
Add this under the first one |
Programming › Re: Chronicle Of A Data Scientist/analyst by Dum20: 3:31pm On May 30, 2020 |
High Guys,
I believe this will be useful as you try to understand the data science industry.
I could not upload the full image at once |