It was copyright infringing files that I did not know about in the rhodium archive.
Vesp
- Administrator
- Foundress Queen





- Posts: 3,130
salat
- Dominant Queen




- Posts: 276
I wondered cause I had just posted a UN file which should have been ok. That and I am accident prone.
Glad it was resolved.
Salat
Glad it was resolved.
Salat
amindset
- Larvae

- Posts: 13
Fuck me I panicked when I couldn't get to the site.
Glad everything's OK
Glad everything's OK
Sedit
- Global Moderator
- Foundress Queen





- Posts: 2,099
It happened again? I think the archives are going to pose a similar issue that my site caused and if its being indexed you can bet this will not be the last time it happens.
akcom
- Dominant Queen




- Posts: 430
Looks like we still don't have a robots.txt so this isn't really all that surprising?
Tsathoggua
- Autistic sociopath
- Foundress Queen





- Posts: 662
What did we miss?
lugh
- Global Moderator
- Foundress Queen





- Posts: 876
Quote
What did we miss?
The Host Monster Suspension of Service Web Page

Those files need to be hosted where their availability can't be interfered with so easily
It's time to consider alternatives 
Shake
- Dominant Queen




- Posts: 276
i must say i shat myself
xxxxx
- Larvae

- Posts: 8
Vesp, You need to add a robots.txt file to the root directory of this site. At the moment there is none and that allows the files in the archives to be indexed. There is a header set for the forum section only which prevents indexing but that is a null point as the forum is visible to members only.
At the moment if you google "site:thevespiary.org" it shows up that there are 706 pages index! This is where the copyright problems are coming from.
You need to put the following in a robots.txt file
It might take a few weeks for the search engines to recrawl and update their indexes but the site should be removed from the searchs then!
At the moment if you google "site:thevespiary.org" it shows up that there are 706 pages index! This is where the copyright problems are coming from.
You need to put the following in a robots.txt file
Code: [Select]
User-agent: *
Disallow: /It might take a few weeks for the search engines to recrawl and update their indexes but the site should be removed from the searchs then!
akcom
- Dominant Queen




- Posts: 430
FYI the only reason they crawl it in the first place is because weve got a front page to crawl from. There is really no reason to have that. It just draws more attention to the highly illegal content of this website
Vesp
- Administrator
- Foundress Queen





- Posts: 3,130
http://127.0.0.1/robots.txt
Fixed.
Also the URL to the rhodium archive has been changed to http://127.0.0.1/Rhodium/ instead of http://127.0.0.1/rhodium/ for a while..
Fixed.
Also the URL to the rhodium archive has been changed to http://127.0.0.1/Rhodium/ instead of http://127.0.0.1/rhodium/ for a while..
xxxxx
- Larvae

- Posts: 8
Perfect that should work! It will just take a couple of weeks to see the changes in search engines.
I think most of the pages indexed are in the archives. They are being found by Google as they are being linked too from other indexed websites. I don't see any links from the frontpages to the archive so they aren't really a factor in the pages being indexed.
I think most of the pages indexed are in the archives. They are being found by Google as they are being linked too from other indexed websites. I don't see any links from the frontpages to the archive so they aren't really a factor in the pages being indexed.
overunity33
- Subordinate Wasp



- Posts: 218
So whos ready for an offshore private wiki, references will never be a problem again
akcom
- Dominant Queen




- Posts: 430
Palladium, I think the front page at one point linked to the rhodium archive and thats where it was spidered from. I don't think people are linking back to this site elsewhere?
xxxxx
- Larvae

- Posts: 8
Akcom, Dearch for "site:thevespiary.org" and go down through the results to where there is pages from the archive. You can then search for "linkto:http://127.0.0.1/rhodium/Rhodium/chemistry/coca.htm" for example to see where google found that link. For that example someone has linked to the archieve from sciencemadness and google spidered the archieve from there.
akcom
- Dominant Queen




- Posts: 430
ah, I stand corrected. Not to harp on it, but I still think having that front page is a bad idea. Why draw more attention than we've already got? Everytime I come here and I forget to enable TOR it makes me cringe.
Vesp
- Administrator
- Foundress Queen





- Posts: 3,130
I'll get rid of it soon
Terror
- Pupae


- Posts: 62
Good work on the front page changes, just saw it. Looks low key and sleek.
The Lone Stranger
- Subordinate Wasp



- Posts: 198
When i hit go useing this sites web adress i get the registration page and cant get passed it ...... SO ...... what does "Guest 03:49:13 AM Viewing the board index of The Vespiary." mean ? Can they see it ? If so it needs cureing FAST .
There are ways of asking some spiders / google and co NOT index sites and also to delete what they already have stored ......... maybe only in some countrys but its posible here .............AND.............. there is probably ? an official forum from the makers of this site software who will know the best methods for keeping it private and off the radar = ask them .
Personaly i have fuck all to loose as i have and do nothing but i dont even want to loose that ................
There are ways of asking some spiders / google and co NOT index sites and also to delete what they already have stored ......... maybe only in some countrys but its posible here .............AND.............. there is probably ? an official forum from the makers of this site software who will know the best methods for keeping it private and off the radar = ask them .
Personaly i have fuck all to loose as i have and do nothing but i dont even want to loose that ................
Vesp
- Administrator
- Foundress Queen





- Posts: 3,130
Quote
When i hit go useing this sites web adress i get the registration page and cant get passed it ...... SO ...... what does "Guest 03:49:13 AM Viewing the board index of The Vespiary." mean ? Can they see it ? If so it needs cureing FAST .
Cannot be seen, a guest will only ever view that login page..
