--- Day changed Mon Sep 22 2008 00:00 < kanzure> Where's my login link? 00:03 < kanzure> yay 00:03 < kanzure> http://designworld.matweb.com/search/DataSheet.aspx?MatID=1876&ckck=1 00:03 < kanzure> a number :) 00:05 < kanzure> goes up to 80920 00:05 < kanzure> http://designworld.matweb.com/search/DataSheet.aspx?MatID=80920&ckck=1 00:05 < kanzure> which seems to be a material called 'rswtest rswtest rswtest' (a polyethylene, no less) 00:09 < kanzure> yay, I'm so happy 00:09 < kanzure> I've never seen a site with such divergent subdomains 00:09 < kanzure> one's using numeric identifiers, another's using plaintext, another's using MD5, what's going on there? 00:10 < fenn> subcontractors 00:10 < kanzure> wahhh 00:10 < kanzure> they have rate limiting 00:10 < fenn> better get some proxies 00:11 < kanzure> they rate limited me after about 100 pages 00:11 < kanzure> so for 80,000 pages, I need 800 proxies 00:11 < kanzure> hrm 00:11 < kanzure> where am I going to get 800 bots? 00:11 < fenn> you dont need 800? 00:12 < kanzure> isn't that peanuts or something in the russian financial information realm? 00:12 < kanzure> what? 00:12 < kanzure> well, I guess, if I can change each of their IP's a few times 00:12 < fenn> rate limiting doesnt mean they just cut you off does it? 00:12 < kanzure> they just cut *me* off 00:12 < kanzure> I don't know what their rate limit is. hrm. have to figure that out 00:14 < kanzure> 'Your IP Address has been restricted due to excessive use. The problem may be compounded when an IP address may be shared by many people in a company or through an internet service provider. We apologize for any inconvenience. Registered and Premiums users are allowed a higher hit limit. If you are a registered or premium user, and you are accessing MatWeb as an anonymous user (not logged in), try logging on, and see if that fixes this problem. If you 00:14 < kanzure> logging in does not fix of course 00:15 < kanzure> ip ban site-wide. 00:15 < kanzure> ideas? 00:16 < fenn> how many pages are there? 80k? 00:16 < kanzure> yes 00:17 < fenn> no ideas 00:17 < kanzure> rate limit of 10 sec per page would lead to a total download time of 9 days 00:17 < kanzure> I guess it's worth it. 00:20 < kanzure> if I split up the queue would you be willing to do a part? 00:36 < kanzure> http://heybryan.org/mediawiki/index.php/Matweb 00:43 < kanzure> I'm so lazy. 00:44 < willPow3r> everybodys like that today 00:46 < kanzure> Hm? 00:46 < kanzure> why's that? 00:49 < kanzure> willPow3r: how about you choose a packet to do? :) 00:49 < willPow3r> whats going on? 00:49 < willPow3r> in 7 words or less 00:49 < willPow3r> too lazy to read more 00:50 < kanzure> click link, click link where not taken 00:50 < willPow3r> ok need link 00:50 < kanzure> http://heybryan.org/mediawiki/index.php/Matweb 00:51 < willPow3r> where do i start? 00:52 < willPow3r> fuck, i'm on vista. no cygwin. need to get out old laptop for linux brb 00:52 < kanzure> hah :) 00:53 < willPow3r> i need a 5th monitor 00:53 < kanzure> rawr 01:02 < kanzure> of course the 'email back' method is flawed 01:02 < kanzure> since it's a 2.3 GB data set 01:02 < kanzure> hrm 01:02 < kanzure> hey, you think they block Google? 01:03 < kanzure> http://72.14.205.104/search?q=cache:5KC6LX9RHY0J:asia.matweb.com/search/SpecificMaterial.asp%3Fbassnum%3DMA1199O+site:matweb.com+IP+address&hl=en&ct=clnk&cd=2&gl=us&client=iceweasel-a 01:03 < kanzure> guess so 01:03 < kanzure> but doesn't block the Wayback Machine. 01:03 < ybit> " 06:03 kanzure : markhamalainen: There's a few people doing STM DNA sequencing for dirt cheap, that's an avenue I've been looking into. " i don't think he ever responded, i think it was due to my dumb question 01:03 < kanzure> Huh? 01:03 < kanzure> You didn't say a dumb question. 01:03 < ybit> in the ustream chat 01:03 < kanzure> Did you? 01:04 < ybit> well, it wasn't dumb, but it got in the way of him responding to you i think 01:13 < kanzure> http://zachtronics.emala.net/pivot/entry.php?id=18 manufactoid game 01:23 < kanzure> hm 01:23 < kanzure> they block at n=100, no matter how long you wait between downloads 01:35 < willPow3r_> kanzure, what was the link to that project you're working on? 01:37 < kanzure> willPow3r_: Which project? 01:37 < kanzure> there are many :) 01:39 < willPow3r_> you had a bunch of chunks of data to process or something 01:39 < kanzure> http://heybryan.org/mediawiki/index.php/Matweb 01:39 < kanzure> They block you after you download 100 of them 01:39 < kanzure> so I'm still thinking about how to do this. 01:39 < willPow3r_> tor 01:39 < willPow3r_> use a dynamic ip 01:40 < kanzure> willPow3r_: I've never figured out how ot use tor 01:40 < kanzure> any hints? 01:40 < kanzure> I mean, I've never had my introduction to tor 01:41 < willPow3r_> basically it bounces all of your network traffic through a bunch of relay servers first 01:41 < willPow3r_> making your ip basically irrelevant and untraceable 01:41 < willPow3r_> matweb is blocking your ip i would assume 01:41 < kanzure> oh, I know what it /is/ but not how to use it or where to download stuffs :) 01:42 < kanzure> yes 01:42 < willPow3r_> oh right 01:42 < willPow3r_> do you have to log in to their site to d/l? 01:42 < kanzure> Nope. 01:45 < willPow3r__> forgot i had vista set as my network bridge 01:46 < willPow3r__> anyway 01:46 < kanzure> ? 01:47 < willPow3r__> since they're blocking your ip, you just need to anonymize it and they wont be able to track your usage 01:47 < kanzure> right 01:47 < kanzure> but how? 01:53 < fenn> set your http proxy to one of these IP's? http://proxy.org/tor.shtml 01:53 < kanzure> Will it always give me a new Ip on each page reload ? 01:53 < kanzure> erm 01:54 < willPow3r__> http://www.torproject.org/documentation.html.en#RunningTor 01:54 < fenn> File: ~/.profile or ~/.bashrc http_proxy=http://127.0.0.1:8118/ 01:54 < fenn> HTTP_PROXY=$http_proxy 01:54 < fenn> export http_proxy HTTP_PROXY 01:54 < kanzure> fenn: what does setting HTTP_PROXY in bash do? 01:54 < fenn> nfc 01:54 < kanzure> is that a global variable that wget allows? 01:54 < kanzure> oh 01:54 < kanzure> :) 01:55 < fenn> oddly enough there's no mention of it in the wget man page 01:56 < willPow3r__> HTTP_PROXY is followed only by certain programs 01:56 < fenn> but it does mention *_proxy environment variables 01:56 < willPow3r__> most don't use it 01:58 < fenn> is it possible to copyright data? like numbers and such 01:58 < fenn> i know the RIAA has made a mockery of copyright but still 01:59 < kanzure> lawyers can very easily say yes IIRC 02:00 < fenn> but it's not like they just made the data from scratch, they got it from somewhere 02:00 < kanzure> companies/manufacturers sending them data sheets 02:00 < kanzure> they will argue that they put the effort into typing it into their format 02:01 < fenn> big deal, aggregation doesn't mean you own the semantic content 02:01 < kanzure> okay 02:01 < kanzure> so 02:01 < fenn> i guess i'm not helping 02:01 < kanzure> heh 02:02 < willPow3r__> you're using wget to retrieve the datasets? 02:02 < kanzure> yeah 02:03 < willPow3r__> need to tunnel downloads through tor 02:04 < kanzure> So how do I use a different IP each time ? 02:04 < kanzure> or each 100 downloads at least 02:04 < willPow3r__> since wget is restarted for each d/l 02:04 < willPow3r__> it should have a different one for each d/l 02:05 < kanzure> even with proxdy = 127.0.0.1:8118 ? 02:06 < fenn> you can check to see if it's working by wgetting from your own server and checking the logs? 02:06 < kanzure> good idea 02:16 < willPow3r__> tor uses servers on the internet to randomize your ip 02:16 < willPow3r__> doing it from your own machine probably wont work 02:17 < kanzure> hm, I'm filling out a resume builder on the university's website 02:17 < kanzure> they want me to list 'accomplishments' 02:17 < kanzure> have I done world domination yet? 02:18 < willPow3r__> thats not until tomorrow night, pinky 02:18 < kanzure> narf 02:21 < kanzure> http://heybryan.org/resume/2008-09-21_resume.pdf Any more bullshitting that I've missed? 02:32 < kanzure> oh god 02:32 < kanzure> what does a failing LCD look like? 02:32 < willPow3r__> horizontal or vertical lines of solid colors 02:33 < kanzure> this looked like a flashback to microprocessor schematics 02:33 < kanzure> the ones that are so dense that it looks like you're viewing a memory dump in color 02:33 < fenn> snow crash 02:33 < kanzure> hm? 02:34 < fenn> = corrupted video mem 02:43 < willPow3r__> wget uses port 80 right? 02:44 < kanzure> could use whatever it wants 02:44 < kanzure> but HTTP servers usually are on port 80 :) 02:44 < willPow3r__> right 02:45 < willPow3r__> i assume matweb is port 80 02:48 < fenn> sheesh, had to go 5 pages deep in google image search: http://flickr.com/photos/48889110751%40N01/23939804 02:48 < fenn> (a snow crash) 02:49 < kanzure> willPow3r__: yes 02:49 < kanzure> fenn: yeah, except less colorful :) 02:49 < kanzure> flickering every second as if it was trying to refresh, but it was just going back to the same thingy 02:50 < kanzure> shut the lid a few times, ALT+F2'ed over to something else, went back and all was okay 02:51 < kanzure> okay, done bullshitting the resume for now I guess 03:03 < kanzure> weird, man wget randomly drops off when talking about environmental variables for proxies 03:17 < kanzure> Hm, the proxies don't connect 03:53 < kanzure> okay, so tor is pretty slow :) 03:58 < kanzure> crap, it's not randomizing 04:04 < kanzure> http://cuil.com/ has been slamming my servers 04:17 < willPow3r__> tor isn't working? 04:20 < willPow3r__> what is this data for? some kind of autoengineering program? 04:26 < kanzure> yes 04:26 < kanzure> a nonexistant program of course 04:26 * kanzure is working on that particular problem 04:26 < kanzure> tor is working now, seems to be randomizing too 04:26 < kanzure> argh 04:26 < kanzure> wait, yeah, it's not randomizing 04:28 < willPow3r__> does it only switch the ip when the tor proxy restarts? 04:28 < kanzure> no, it seems to switch randomly 04:29 < kanzure> for instance, two successive wgets have the same ip 04:29 < kanzure> yet, it's clearly not the one I started with when I tested earlier 04:30 < kanzure> what do I do with http://proxy.org/tor.shtml ? 04:31 < kanzure> maybe firefox's switchproxy is using a mechanism I could steal 04:32 < kanzure> wait, no, switchproxy doesn't do it 07:43 < willPow3r__> brb 08:03 < fenn> wouldnt this be nice: http://russnelson.com/dmca-hacking.html 12:35 < kanzure> http://www.lulu.com/content/182342 China's upcoming problems, in the form of a short sf book or something 12:35 < kanzure> haven't actually looked at it, but I scoped it off of the wta-talk mailing list 12:36 < kanzure> I've asked #tor, and I've been googling around, nobody seems to know how to force tor/privoxy to change IPs. 12:36 < kanzure> forcing a reload doesn't do the trick either 12:41 < nsh> wta-talk? 16:10 < kanzure__> Hi all. 16:10 < user1> oi kanzure__ 16:11 -!- user1 is now known as ybit 16:11 -!- ybit is now known as ybit-school 16:11 < kanzure__> Does anyone ever forget from which terminal they are typing? 16:11 < kanzure__> I think this counts as school. Not sure. 16:11 < ybit-school> heh 16:12 < kanzure__> http://w-shadow.com/blog/2008/06/20/tor-how-to-new-identity-with-php/ 16:12 < kanzure__> Take a look at the signal NEWNYM request. 16:12 < kanzure__> There's a port that tor sits on waiting for commands? 16:12 < kanzure__> Isn't that a bad idea? Anyone? procto? ;-) 16:17 < kanzure__> How to force a new identity: http://ubuntuforums.org/showthread.php?t=558051 16:17 < kanzure__> Also, http://heybryan.org/mediawiki/index.php/Bioreactors - added a link that Charlie scoped for the design of bioreactors with strands of fiber optic cables 16:17 < kanzure__> Not sure if that's the best way to distribute the photons over the surface area for the algaes and so on 16:19 < ybit-school> what's wrong with the php script? 16:19 < ybit-school> seems fine 16:19 < kanzure__> php script isn't the problem 16:19 < kanzure__> Say you have an open port 16:19 < kanzure__> on which you're allowing people to send commands to the tor daemon 16:20 < kanzure__> generally this is not a good idea for any sort of program 16:20 < kanzure__> but more worrisome would be something like tor 16:21 < ybit-school> i see 16:22 < kanzure__> anywho, I think telnet | "AUTHENTICATE\nsignal NEWNYM\nquit" should do the trick 16:22 < kanzure__> erm, telnet 127.0.0.1 9051 16:22 * kanzure__ had to bitbang his way to find his laptop's ip address .. it was only a few numbers off from his server :) 16:22 < ybit-school> "and you can download the zip file to get everything at once. This way, you don't have to individually click each one. " going soft on everyone, aye? :) (by not making them learn wget or similar) 16:23 < kanzure__> not yet quite in a position to be making such godly demands on people :p 16:23 < kanzure__> "linux! gasp!" 16:24 < kanzure__> aren't people in a university environment supposed to know linux? why am I sitting on an XP machine? 16:24 * ybit-school too is on an xp machine unfortunately :| 16:25 < ybit-school> are you sshing into your computer? 16:25 < kanzure__> sshing into a few things 16:26 < kanzure__> I think I might be on my server right now 16:26 < kanzure__> not sure, I am also very likely on my other machine in the dorm 16:26 < kanzure__> hrm 16:26 < kanzure__> maybe I'm on the korea box 16:26 < ybit-school> heh 16:26 * kanzure__ feels like he's playing endgame:singularity 17:13 < kanzure__> Anybody know what to send to the commandport to figure out which ip tor is currently masking you as? 17:13 < kanzure__> instead of me ruthlessly bashing whatismyip.org or my own server for such a purpose .. 17:14 < kanzure__> reason why is that I'm trying to see why matweb immediately blocks my tor usage 17:16 < kanzure__> How would it know to immediately block that ip? That's incredibly unlikely. 17:16 < kanzure__> (yes, I've switched the user agent) 17:20 < kanzure__> yeah, so when I run the script on over my server, the ip address is changing 17:20 < kanzure__> meaning all of these random tor IP's are already blocked? wtf? 17:23 < kanzure__> Aha. Okay. There we go. 17:23 < kanzure__> Weird scripting stuff going on there. 17:23 < kanzure__> (yes, I edited the bashrc files.) 17:24 < kanzure__> at one point I thought that the script might not be working because the script was chowned by bryan but for some reason I'm going around as root, so maybe the script would execute with bryan's permissions on /bin/bash 17:24 < kanzure__> apparently that wasn't it. 17:38 < kanzure__> weird, it's randomy blocking me 17:38 < kanzure__> #67 blocked, #68 good, #69 good, #70 blocked, .. 17:39 < kanzure__> Am I really hitting IPs that are blocked? Need to stop it at one where it's blocked and test I guess. 17:45 < kanzure__> does anyone remember how to stop wget from following 302s? 19:07 < kanzure> http://nihroadmap.nih.gov/pioneer/Recipients08.aspx 19:58 < kanzure> http://heybryan.org/school/Calculus/2007-11-13.html <--- The sentence "Z0mg! We're going to crash into the wall of text!" comes to mind. This is generally what lectures are sounding like for me these days. Everybody speaks ambiguously and there's no formal ontology to digest really. Solutions? 19:59 < fenn> show up to school with an introdus gun and start firing logic implants into peoples' brains 20:03 < kanzure> mathml/latex doesn't really do what needs to be done 20:03 < kanzure> video capture is the best you can do if you're lazy 20:03 < kanzure> but then why attend? 20:03 < kanzure> of course, you have to attend to get the video 20:04 < kanzure> but then you have to devote some amount of time in some godforsaken cryptic proportion to the realtime video to decrypt their pathetic excuse of 'formal' education 20:05 < kanzure> it's kind of like the pencil markup complaint 20:05 < kanzure> hard to distinguish the 'here is your equation' and then the 'now we have scribbled on it' states 20:05 < kanzure> while also distinguishing equations, theorems, epistemiological approaches to their activities, etc., which is all densely packed into very, very bad english 20:06 < kanzure> of course the typical case is to just only catch some of the stuff and not all, that's not for me 20:06 < kanzure> I wonder if it would be possible to throw up mathml + latex + ocw + some more eyerolling metamarkup to represent the notes/annotations that people should be making 20:06 < kanzure> then it would be much, much easier 20:07 < kanzure> because you could even write little bots/assistant-agents to guide students in their recording of the material of what to take away 20:14 < kanzure> hm, that ocw dataset for lectures / knowledge could be seeded by metamath.org if they have that in some computationally readable format 20:14 < kanzure> where are my ancient theorem solvers anyway? 20:14 < kanzure> (not mine, actually) 21:57 < willPow3r__> kanzure, according to the tor website, the ip address is changed every 10 minutes 21:59 < spookact> tor? someone planning on hacking the planet? 22:41 < kanzure> spookact: Something like that. 22:44 < kanzure> http://repository.designengineeringlab.org/ 22:44 < kanzure> free registration 23:01 < kanzure> hm, Google is speaking tomorrow 23:01 < kanzure> apparently a Tech Talk 23:03 < kanzure> monroe_l@whatever bounced 23:05 < fenn> that wasn't an email 23:06 < fenn> monroe_lee2000@yahoo.com 23:06 < fenn> also see http://sites.google.com/site/teamprometheusnprize/ if you want to be demotivated 23:08 < kanzure> holy shit 23:08 < kanzure> flashback to the 1998 23:08 < kanzure> *to 1998 23:08 < kanzure> all sites used to be like that 23:08 < willPow3r__> angelfire tripod and geocities, yep 23:08 < kanzure> oh man :) 23:09 < kanzure> the holy trinity 23:09 < fenn> it was made with google automatic page creation stuffs 23:09 < kanzure> there was also homestead and maxpages 23:09 < fenn> netzero 23:09 < willPow3r__> lol 23:09 * kanzure was deeply entrinched in the bullshit 23:09 < willPow3r__> nostalgia indeed 23:09 < kanzure> I was a "master hacker" for using angelfire 23:09 < kanzure> anybody who knew HTML was a *god* 23:10 < kanzure> there was this whole knowledge economy kind of developed around the idea 23:10 < kanzure> somehow I had to do favors just to get slanted text with a blinking background 23:10 < willPow3r__> everybody else was using primitive WYSIWYG editors 23:10 < kanzure> well, I was too 23:10 < kanzure> I was like. 10 :) 23:10 < willPow3r__> that inevitably made everything look like myspace 23:11 < willPow3r__> heh 23:11 < willPow3r__> i was in high school 23:12 < kanzure> http://web.archive.org/web/20011030033554/masterhackerhq.homestead.com/ 23:12 < kanzure> see? 23:13 < willPow3r__> so *you* were the official master hacker? 23:13 < fenn> you knew all the 1337 tricks for editing URL's 23:13 < kanzure> ye 23:13 < kanzure> stfu n00b etc. 23:13 < kanzure> had an era of that too. 23:14 < fenn> sometimes i wonder if it's a good thing that i lost all my old computer data 23:14 < kanzure> http://web.archive.org/web/20010815195203/http://pub70.ezboard.com/bcodemaster007 23:14 < kanzure> heh 23:14 < kanzure> it is :) 23:14 < fenn> damn zip disks 23:14 < kanzure> it's nice to still have online buddies from 2001 though 23:15 < willPow3r__> i still have a hotmail email from 1996 23:15 < fenn> 1996 eh 23:15 * fenn pulls random date out of pants 23:15 < fenn> 1824, beat that! 23:16 < kanzure> you had a date in your pants? 23:16 < fenn> pocket pussy 23:16 < fenn> i saw it on "8mm" 23:16 < willPow3r__> the fleshlight is the new thing 23:17 < willPow3r__> pocket pussies are a thing of the past 23:17 < kanzure> weren't we worrying a while back about entry points for kids these days? 23:17 < fenn> entry points? 23:17 < kanzure> uhm, 23:17 < fenn> like, how to start your hacking career at age 12? 23:17 < kanzure> sort of 23:17 < kanzure> except not necessarily all into "leet hax0ring" and such 23:18 < fenn> the sad truth is that you'll get sent to guantanamo bay for hacking into the local school district 23:18 * kanzure hides the stolen db of rosters on his hdd 23:18 * kanzure should plug that data into google maps for kicks. 23:18 < kanzure> it had addresses, phone numbers, everything 23:19 < willPow3r__> you should put it on classmates.com 23:19 < kanzure> The annoying ad site? 23:19 < kanzure> they do something other than ads? really? 23:20 < willPow3r__> i'm pretty sure they provide some sort of service 23:20 < willPow3r__> to the technically illiterate 23:21 < fenn> in exchange for all your personal data they'll give you the privelege of viewing PM's people have sent you 23:22 < fenn> i'm amazed at the price variations in sex toys 23:22 < willPow3r__> i thought you had to give them money too 23:22 < fenn> oh, maybe 23:25 < kanzure> anyway, entry points 23:25 < kanzure> I'm not really talking about any real hacking 23:25 < kanzure> but I do mean getting kids hooked on bits and bytes 23:25 < kanzure> steady information diet of some sort. 23:25 < fenn> Extend your penis a solid 1.5? with this unbelievably lifelike Cyberskin sleeve. Comes with Climax Personal Lubricant and Cyberskin Renew ? to keep your sleeve in top condition.... 23:26 < kanzure> isn't that what circumcision removes? 23:26 < kanzure> so wouldn't that be redundant for whatever percentage of the population etc. 23:26 < kanzure> guess not 1.5 23:26 < kanzure> anyway 23:26 < fenn> it appears to be a very thick condom, like half an inch thick 23:26 < kanzure> what are you uh doing ? 23:27 < willPow3r__> /b/ is bad for your health 23:27 < kanzure> heh 23:27 < kanzure> it's true 23:28 < kanzure> the #tor goons tell me that I want to trace my tor route to get the destination ip 23:29 < fenn> i love how the sex toy page has this ad banner with disney characters that says "anything's possible. keep thinking." 23:29 < willPow3r__> disneyland is the happiest place on earth. even for paedophiles. 23:29 < kanzure> are we talking about classmates.com ? 23:30 < kanzure> or /b/ ? 23:30 < willPow3r__> lol fenn: STAY AWAY FROM /b/!! 23:30 < fenn> how did /b/ come up? 23:31 < willPow3r__> you started talking about sex toys. so it was assumed you were about to get your fap on 23:31 < willPow3r__> but anyway. entry points 23:32 < kanzure> let's not mix these two together 23:40 < willPow3r__> kanzure, why do you need to run a tracert when you can just hit whatismyip.com 23:40 < willPow3r__> should be the same 23:40 < kanzure> it's not a tracert that I need to run, there's some internal variable I think 23:40 < kanzure> hitting a website takes longer 23:40 < kanzure> I'd like to minimize the number of hits because I'm already going to be stressing things 23:45 < kanzure> man I miss 4chan, it's "FAIL GTFO" then "spontaneously insightful genius posts to clear up everything with humanity, but unfortunately it's off the queue once you realize it" 23:45 < kanzure> oh crap. :) Anyway, back to work. 23:52 < kanzure> oh, balloon launch. good. 23:53 < willPow3r__> ?? 23:53 < kanzure> fenn's http://sites.google.com/site/teamprometheusnprize/ link