Welcome, Guest: Register On Nairaland / LOGIN! / Trending / Recent / NewStats: 3,143,315 members, 7,780,762 topics. Date: Thursday, 28 March 2024 at 09:28 PM |
Nairaland Forum / Science/Technology / Programming / Simple Banned Word Algorithm. (1144 Views)
Your Toughest Algorithm / Ludo Game Algorithm Wanted For AI Project / Algorithm For Solving Systems Of Linear Equations (2) (3) (4)
Simple Banned Word Algorithm. by kambo(m): 3:36am On Jun 18, 2013 |
THE Algorithm for capturing banned words for NL is so easy to bypass. this is just a simple algorithm that can catch certain cases. An all case scenario filter would i think be very complex. catches cases like ban word , punctuated with non alphabetic characters. e.g if banned word = ban,banned,banner. and user puts ba.n, ban.ned, etc . algorithm: ------------ one- . split sentence into words. - - remove non alphabetic characters from words. - check word against banned word database, - if match, substitute banned word with NL version. e.g ba.n -> ban (after removing non alphabetic characters -> check against NL db . match found found -. replace. fu....ck, b.rea.st, etc... all in the net now. |
Re: Simple Banned Word Algorithm. by sarutobi: 10:08am On Jun 18, 2013 |
oga catch this ones: breeast fucck boobi pusi pussyy etc. truth is, no matter how smart these bots are, humans will get smarter. Afterall they were designed by humans. Nice post though. 2 Likes |
Re: Simple Banned Word Algorithm. by Nobody: 3:25pm On Jun 18, 2013 |
sarutobi: oga catch this ones:This would need a complex regex |
Re: Simple Banned Word Algorithm. by Nobody: 6:47am On Jun 19, 2013 |
sarutobi: oga catch this ones:U b baaad guy I thnk dis will need a whole english dico to be put in the database, every word u typed is checked against d database, if not found, it is substituted with d nearest one and chcked against d banned words, if it matches d banned word, then an action is taken! |
Re: Simple Banned Word Algorithm. by WhiZTiM(m): 8:16am On Jun 21, 2013 |
hahaha @sarutobi... funny examples... Though, the best way to build up algorithms to take care of cases like that is to use: Nearest Neighbor heuristics; and high collision probability LSH L(ocality-sensitive hashing).... Well, banned words system is either a lazy or a simple forum filtering system . . . for a revenue generating website like NL... How about banned words + phrases....?? Grammar and linguistic systems. . . start implementing primitive systems as such.... its hard but good for commercial stuff.... Seun has a lot of big data of comments on NL, we could build such Probabilistic filtering systems.... And make it learn the kind of grammar it should ban... from a set of comments with insults.... |
(1) (Reply)
Need Someone That Can Put Me Through On Using Amazon Web Service. / Programmers Come To My Aid Please. / Naira Land Computer Gurus Pls Help
(Go Up)
Sections: politics (1) business autos (1) jobs (1) career education (1) romance computers phones travel sports fashion health religion celebs tv-movies music-radio literature webmasters programming techmarket Links: (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Nairaland - Copyright © 2005 - 2024 Oluwaseun Osewa. All rights reserved. See How To Advertise. 12 |