Bay 12 Games Forum

Please login or register.

Login with username, password and session length
Advanced search  
Pages: 1 ... 7 8 [9] 10 11 ... 16

Author Topic: That webcrawler game, obscene preference confederate maximizer  (Read 14192 times)

freeformschooler

  • Bay Watcher
    • View Profile

I will just say the name can't really be improved much further, I think.
Logged

Aklyon

  • Bay Watcher
  • Fate~
    • View Profile

I cant find le move button D:
Someone do a screencap of the move button,
Spoiler: Ze Button (click to show/hide)
Logged
Crystalline (SG)
Sigtext
Quote from: RedKing
It's known as the Oppai-Kaiju effect. The islands of Japan generate a sort anti-gravity field, which allows breasts to behave as if in microgravity. It's also what allows Godzilla and friends to become 50 stories tall, and lets ninjas run up the side of a skyscraper.

DrPoo

  • Bay Watcher
  • In Russia Putin strikes meteor
    • View Profile

I cant find le move button D:
Someone do a screencap of the move button,
Spoiler: Ze Button (click to show/hide)

Le thx, now tell me your email's so i can add yer as commiters.. :D
Logged
Would the owner of an ounce of dignity please contact the mall security?

counting

  • Bay Watcher
  • Zenist
    • View Profile
    • Crazy Zenist Hospital

Mine already in my profile.
Logged
Currency is not excessive, but a necessity.
The stark assumption:
Individuals trade with each other only through the intermediation of specialist traders called: shops.
Nelson and Winter:
The challenge to an evolutionary formation is this: it must provide an analysis that at least comes close to matching the power of the neoclassical theory to predict and illuminate the macro-economic patterns of growth

Angel Of Death

  • Bay Watcher
  • Karl Groucho?
    • View Profile

When will this game be ready?
Logged
99 percent of internet users add useless, pulled out of arse statistics to their sig. If you are the 1%, please, for the love of Armok, don't put any useless shit like this in your sig.
Hidden signature messages are fun!

DrPoo

  • Bay Watcher
  • In Russia Putin strikes meteor
    • View Profile

When will this game be ready?

Way too early to say that, we are still awaiting response from soulwynd, so he can upload his DB and let us code on it.

Mine already in my profile.

Yes, but ya know, you cant really see the adress, only message it :(
Logged
Would the owner of an ounce of dignity please contact the mall security?

counting

  • Bay Watcher
  • Zenist
    • View Profile
    • Crazy Zenist Hospital

Mine already in my profile.

Yes, but ya know, you cant really see the adress, only message it :(

I didn't know that before @@... I thought everyone could see that
countingtls@ g m a i l .com
Logged
Currency is not excessive, but a necessity.
The stark assumption:
Individuals trade with each other only through the intermediation of specialist traders called: shops.
Nelson and Winter:
The challenge to an evolutionary formation is this: it must provide an analysis that at least comes close to matching the power of the neoclassical theory to predict and illuminate the macro-economic patterns of growth

Stargrasper

  • Bay Watcher
    • View Profile

Just because I didn't see anyone else mention it during my very quick skim over the thread...robots.txt.  There's also some meta tags.  And one or two other protocols designed to restrict robots.  Mind these don't do anything on their own.  These are just the rules set out by the webmaster and it's up to the robot to follow those rules.  Don't and you'll get blacklisted.  For example, the B12 robots.txt completely blocks most bots from touching the forums.  The Wikipedia robots.txt blocks robots from touching dynamically generated pages and also includes a spider trap.  Spider traps are devious little designs used to f**** robots that don't parse the robots.txt file.

I didn't pay that close attention, but it looks like you guys were at least partially serious about making this game.  If so, pay attention to restriction protocol or you'll find the game end badly when your bots get blacklisted.

http://www.bay12games.com/robots.txt
http://www.wikipedia.org/robots.txt
http://www.robotstxt.org/
Logged

DrPoo

  • Bay Watcher
  • In Russia Putin strikes meteor
    • View Profile

Just because I didn't see anyone else mention it during my very quick skim over the thread...robots.txt.  There's also some meta tags.  And one or two other protocols designed to restrict robots.  Mind these don't do anything on their own.  These are just the rules set out by the webmaster and it's up to the robot to follow those rules.  Don't and you'll get blacklisted.  For example, the B12 robots.txt completely blocks most bots from touching the forums.  The Wikipedia robots.txt blocks robots from touching dynamically generated pages and also includes a spider trap.  Spider traps are devious little designs used to f**** robots that don't parse the robots.txt file.

I didn't pay that close attention, but it looks like you guys were at least partially serious about making this game.  If so, pay attention to restriction protocol or you'll find the game end badly when your bots get blacklisted.

http://www.bay12games.com/robots.txt
http://www.wikipedia.org/robots.txt
http://www.robotstxt.org/

Fuck.. didnt think about this one? But how does wikipedia quest THEN work?
Maybe we could make a bot browser, so it looks like a human browsing..
Logged
Would the owner of an ounce of dignity please contact the mall security?

FunctionZero

  • Bay Watcher
    • View Profile

Just because I didn't see anyone else mention it during my very quick skim over the thread...robots.txt.  There's also some meta tags.  And one or two other protocols designed to restrict robots.  Mind these don't do anything on their own.  These are just the rules set out by the webmaster and it's up to the robot to follow those rules.  Don't and you'll get blacklisted.  For example, the B12 robots.txt completely blocks most bots from touching the forums.  The Wikipedia robots.txt blocks robots from touching dynamically generated pages and also includes a spider trap.  Spider traps are devious little designs used to f**** robots that don't parse the robots.txt file.

I didn't pay that close attention, but it looks like you guys were at least partially serious about making this game.  If so, pay attention to restriction protocol or you'll find the game end badly when your bots get blacklisted.

http://www.bay12games.com/robots.txt
http://www.wikipedia.org/robots.txt
http://www.robotstxt.org/

Fuck.. didnt think about this one? But how does wikipedia quest THEN work?
Maybe we could make a bot browser, so it looks like a human browsing..

Wikipedia only disallows crawling of the special pages. All the articles are fair game.
So Wikipedia Quest could still crawl the regular pages.

So basically it just prevents the bots from looking at pages like Special:Administrators and such.
There's an easibly readable list in every robots.txt, so you can see what each site disallows.
Logged

freeformschooler

  • Bay Watcher
    • View Profile

Yeah, should be easy enough. Looks like bay 12 bots can still parse all of the threads and stuff, just not special pages.
Logged

FunctionZero

  • Bay Watcher
    • View Profile

When this gets done, I'm taking over the Poopcrawler's GoogleCode page. :P
Logged

Aklyon

  • Bay Watcher
  • Fate~
    • View Profile

Yeah, should be easy enough. Looks like bay 12 bots can still parse all of the threads and stuff, just not special pages.
As long as they don't act like the wayback machine.
Logged
Crystalline (SG)
Sigtext
Quote from: RedKing
It's known as the Oppai-Kaiju effect. The islands of Japan generate a sort anti-gravity field, which allows breasts to behave as if in microgravity. It's also what allows Godzilla and friends to become 50 stories tall, and lets ninjas run up the side of a skyscraper.

Stargrasper

  • Bay Watcher
    • View Profile

Wikipedia only disallows crawling of the special pages. All the articles are fair game.
So Wikipedia Quest could still crawl the regular pages.

So basically it just prevents the bots from looking at pages like Special:Administrators and such.
There's an easibly readable list in every robots.txt, so you can see what each site disallows.

Which is precisely the reason I linked to two example robots.txt files.

Fuck.. didnt think about this one? But how does wikipedia quest THEN work?
Maybe we could make a bot browser, so it looks like a human browsing..

Besides being unethical and quite possibly illegal...it also wouldn't work.  Especially if the webadmin/sysop/etc is monitoring with anything resembling diligence.

Do yourself a favor and just follow the rules.  I had to make a webcrawler find, parse, and listen to robots.txt a couple of years ago and it's not terribly difficult.  Don't forget about the metatags and anything else that might restrict a robot.  Sysops get pissed when unauthorized robots start constantly crawling their pages (hence spidertraps).  It costs them time, money, bandwidth, and other resources every time a bot touches their network.  Even if you follow the rules, sites that don't outright block everything will still block you specifically if you're enough of a nuisance.  Toady will block you in his robots.txt file if you start sucking up too much bandwidth and money. 
Logged

Aklyon

  • Bay Watcher
  • Fate~
    • View Profile

So, what I believe is a summary so far:
1: Make sure it understands everything it needs to and restricts itself thusly. (robots.txt, metatags, etc.)
2: Do not make it go too fast. Or do too much at once.
3: Don't use it all the time.
Logged
Crystalline (SG)
Sigtext
Quote from: RedKing
It's known as the Oppai-Kaiju effect. The islands of Japan generate a sort anti-gravity field, which allows breasts to behave as if in microgravity. It's also what allows Godzilla and friends to become 50 stories tall, and lets ninjas run up the side of a skyscraper.
Pages: 1 ... 7 8 [9] 10 11 ... 16