The Vespiary

Site => News => Topic started by: callejularbo on March 22, 2018, 10:57:46 AM

Title: Hyperlabs released all their PDFs
Post by: callejularbo on March 22, 2018, 10:57:46 AM
Hyperlabs released their huge archive of PDFs. Its worth checking out, already found several items of interest.
Not sure how to navigate it effectively though.

Here is the reddit thread containing download links:
https://www.reddit.com/r/DrugNerds/comments/84tx1w/hello_fellow_drugnerds_do_you_remember_rhodium/
Title: Re: Hyperlabs released all their PDFs
Post by: Vesp on March 22, 2018, 08:14:40 PM
Nice. I'm away from the computer at the moment but someone should download it before it is gone, and I will host it here.
Title: Re: Hyperlabs released all their PDFs
Post by: hamsterbob on March 23, 2018, 10:16:42 AM
Nice. I'm away from the computer at the moment but someone should download it before it is gone, and I will host it here.

Got it
Title: Re: Hyperlabs released all their PDFs
Post by: thewire on March 23, 2018, 05:17:02 PM
How the hell is  possible 5762 times  read  in less than 1 day?  ??? :o
Title: Re: Hyperlabs released all their PDFs
Post by: Wizard X on March 27, 2018, 04:37:08 AM
How the hell is  possible 5762 times  read  in less than 1 day?  ??? :o

Because anyone on the WWW can read this thread, not only logged in members.

See attachment for all files in the archive.

Title: Re: Hyperlabs released all their PDFs
Post by: embezzler on March 27, 2018, 09:06:01 AM
Exactly, the news forum is public.
Title: Re: Hyperlabs released all their PDFs
Post by: myCH3 on July 27, 2018, 04:43:11 AM
Is there a better place to download that pdf archive?
Title: Re: Hyperlabs released all their PDFs
Post by: Corrosive Joeseph on July 27, 2018, 04:49:26 AM
Is there a better place to download that pdf archive?

I haven't DL'd it myself but user Mush is a reliable source.......

'hyperlab.info PDF archive' - https://www.sciencemadness.org/whisper/viewthread.php?tid=81606

Plz let me know how you get on......... I wouldn't mind a copy myself



/CJ
Title: Re: Hyperlabs released all their PDFs
Post by: loft on July 27, 2018, 09:24:51 AM
I'm not permitted to acces this forum :D
Title: Re: Hyperlabs released all their PDFs
Post by: Corrosive Joeseph on July 27, 2018, 09:30:20 AM
I'm not permitted to acces this forum :D

Are you a registered member...........?   PM admin (woelen or Bert) for access to references and whimsy



/CJ
Title: Re: Hyperlabs released all their PDFs
Post by: loft on July 27, 2018, 09:38:28 AM
I am but my post count is quiet low and I just registered recently
Title: Re: Hyperlabs released all their PDFs
Post by: Corrosive Joeseph on July 27, 2018, 09:46:13 AM
Should be no problem........ But here is a copy of the whole post

"hyperlab.info PDF archive


https://www.reddit.com/r/DrugNerds/comments/84tx1w/hello_fel...

password: 'hyperlab.info
https://cloud.mail.ru/public/9daV/dv8Lmv18C
https://cloud.mail.ru/public/JZzz/7rLtwQWTV
https://cloud.mail.ru/public/9y3H/fD6jgMVzK
https://cloud.mail.ru/public/KMCM/efgjNr6HD

"We at hyperlab.info are sharing information about synthesis and pharmacology of psychoactive drugs since long ago. It actually turned out that we have A LOT of info, not only in form of PDFs, but also as forum posts. We've been working on it, but the forum is mainly in Russia, so it's useful for only a small fraction of the humanity.

But the archive of PDFs recovered from forum attachements turned out to be incredibly powerful per se. It's a collection of scientific articles (mostly in English) that you all love, with the main theme being the synthesis and properties of psychoactive drugs and their precursors.

There are more than 4000 individual files. So, how to navigate them?

When you extract the archive, look for the "HLA.htm" file inside the folder. It contains all information about the files which could be obtained automatically (using Citavi, Mendeley and some simple homemade scripts). There is the title of an article, and a file name (which is sometimes useful) as well. The link will open the file from the local archive. The link labeled "=HL=" will lead you directly to download the attachment from the forum, if needed.

The page looks like this: https://imgur.com/a/HwrC5

There is still a whole lot of work to do, but we hope you guys will enjoy it. The archive was made in 2015, but we hope to make a newer version, which will be almost twice as big.

Feel free to share with others. We would be glad to hear any comments."



/CJ
Title: Re: Hyperlabs released all their PDFs
Post by: loft on July 27, 2018, 12:42:30 PM
But those are the same links as the ones above? ;D
Title: Re: Hyperlabs released all their PDFs
Post by: Corrosive Joeseph on July 28, 2018, 01:01:47 AM
Then why are people looking for alternative DL's...........?

I never even looked myself........ Did anyone actually get the full thing.........?



/CJ
Title: Re: Hyperlabs released all their PDFs
Post by: loft on July 28, 2018, 07:10:32 AM
Yes I did. Over 4000 pdf files - definitely a huge collection of knowledge. No virus detection or something similiar in case someone is curious about that.
Title: Re: Hyperlabs released all their PDFs
Post by: Corrosive Joeseph on March 10, 2019, 01:10:09 AM
Nice. I'm away from the computer at the moment but someone should download it before it is gone, and I will host it here.

Okay............. Any chance this could happen.............? Repeated attempts at downloading these have ended in failure every time


/CJ
Title: Re: Hyperlabs released all their PDFs
Post by: dedihetz on March 11, 2019, 04:03:25 AM
Nice. I'm away from the computer at the moment but someone should download it before it is gone, and I will host it here.

Okay............. Any chance this could happen.............? Repeated attempts at downloading these have ended in failure every time


/CJ

Archive.org mirror:
https://archive.org/details/hyperlab.info_PDF_archive_201903


Direct link:
https://archive.org/download/hyperlab.info_PDF_archive_201903/hyperlab_PDF.7z


No password
Title: Re: Hyperlabs released all their PDFs
Post by: Corrosive Joeseph on March 11, 2019, 07:24:32 AM
Thank you very much dedihetz        8)


/CJ
Title: Re: Hyperlabs released all their PDFs
Post by: maker on April 06, 2019, 03:37:59 PM
Lots of papers there. I will need some hours to filter through and
get an idea of what I may be able to use.
Good info
Title: Re: Hyperlabs released all their PDFs
Post by: The Lone Stranger on May 14, 2019, 03:35:26 PM
Does anyone know how to get rid of the numbers in front of each papers title please ? The numbers make it imposible for me to archive it as my archive is in alphabetical order .
Title: Re: Hyperlabs released all their PDFs
Post by: mathiasxx94 on June 27, 2019, 01:38:29 PM
Does anyone know how to get rid of the numbers in front of each papers title please ? The numbers make it imposible for me to archive it as my archive is in alphabetical order .

This sounds like an easy task with some lines of python or similar code. I think you should be able to find a solution even, if not I can probably write it later today.
Title: Re: Hyperlabs released all their PDFs
Post by: aes256 on June 27, 2019, 05:07:50 PM
Does anyone know how to get rid of the numbers in front of each papers title please ? The numbers make it imposible for me to archive it as my archive is in alphabetical order .

Paste here (or PM me) about a dozen lines of filenames that are representative of the naming scheme and I'll whip out some Python code to rename it for you. If the naming convention is consistent throughout then stripping the numbers is trivial :)
Title: Re: Hyperlabs released all their PDFs
Post by: Wizard X on June 28, 2019, 12:34:03 AM
Does anyone know how to get rid of the numbers in front of each papers title please ? The numbers make it imposible for me to archive it as my archive is in alphabetical order .

Paste here (or PM me) about a dozen lines of filenames that are representative of the naming scheme and I'll whip out some Python code to rename it for you. If the naming convention is consistent throughout then stripping the numbers is trivial :)


Look down this post for "See attachment for all files in the archive."

Download: https://www.thevespiary.org/talk/index.php?action=dlattach;topic=15803.0;attach=10658

Title: Re: Hyperlabs released all their PDFs
Post by: mathiasxx94 on June 28, 2019, 12:55:18 AM
Does anyone know how to get rid of the numbers in front of each papers title please ? The numbers make it imposible for me to archive it as my archive is in alphabetical order .

Not the cleanest code, but it works okay. Haven't tried it on the other folders yet, but I suppose they are similar. The code fucks up on some of the files due to some strange characters or in some cases duplicate file names if removal of the first numbers. The files it doesn't do something with is printed out though so you can change it manually since it's so few. It's Python 2.7 since I'm a degenerate weeb, but should probably work with Python 3 too.

Code: [Select]
import os
path = "D:\hyperlab_archive\PDF.part4\PDF" #Just change the path to yours

for root, dirs, files in os.walk(path):
    for filename in files:
        if filename[0].isdigit():
            try:
                filename_decoded = filename.decode('utf-8')
                old_file = os.path.join('%s', '%s')%(path, filename_decoded)
                firstunderscore = filename.find("_")
                newfilename = filename[firstunderscore+1:].decode('utf-8')
                new_file = os.path.join('%s', '%s')%(path, newfilename)
                os.rename(old_file, new_file)
            except:
                print filename
       
Title: Re: Hyperlabs released all their PDFs
Post by: aes256 on June 28, 2019, 07:55:02 AM
This function should clean up the filenames pretty well and avoid instances that are too hard:
Code: [Select]
def rename(filename):
    """Strip leading numbers from filenames."""

    # The regex below is used to match and rename filenames like this:
    #   515_55.pdf                          55.pdf
    #   2_Busc_Ber_3_269_269_190_.pdf       2_Busc_Ber_3_269_269_190_.pdf
    #   1129_00722a060.pdf                  00722a060.pdf
    #   1221_1.pdf                          1.pdf
    #   2220_16_673.pdf                     16_673.pdf
    #   4186_9781593855864.pdf              9781593855864.pdf
    #   4193_51710234_S20Manual.pdf         51710234_S20Manual.pdf
    #   
    # But avoid renaming filenames like these:
    #   679_.PDF
    #   ???????_1983_212-215.pdf
    #   4-Ethoxy-3,5-dimethoxybenzaldehyd- ??????? -- JACS 76, p5555, 1954.pdf
    #   3,4,5-???? -- JACS 74, p4263, 1952.pdf
    REGEX = r'(^[\d]+_)(.+)(\..*)'
    match = re.search(REGEX, filename.strip())
   
    if filename[0].isalpha():
        return
       
    if match and match.groups() >= 3:
        new_filename = ''.join(match.groups()[1:])
        return new_filename

And this is mathiasxx94's code refactored (BUT UNTESTED) to:


I can't be fucked spinning up a Windows VM right now to test it so Your Mileage May Vary  ;)
Code: [Select]
#!/usr/bin/env python2
import io
import os
import re

HYPERLAB_DIRECTORY = "D:\hyperlab_archive" # Just change the path to yours

def rename(filename):
    """Strip leading numbers from filenames."""

    # The regex below is used to match and rename filenames like this:
    #   515_55.pdf                          55.pdf
    #   2_Busc_Ber_3_269_269_190_.pdf       2_Busc_Ber_3_269_269_190_.pdf
    #   1129_00722a060.pdf                  00722a060.pdf
    #   1221_1.pdf                          1.pdf
    #   2220_16_673.pdf                     16_673.pdf
    #   4186_9781593855864.pdf              9781593855864.pdf
    #   4193_51710234_S20Manual.pdf         51710234_S20Manual.pdf
    #
    # But avoid renaming filenames like these:
    #   679_.PDF
    #   ???????_1983_212-215.pdf
    #   4-Ethoxy-3,5-dimethoxybenzaldehyd- ??????? -- JACS 76, p5555, 1954.pdf
    #   3,4,5-???? -- JACS 74, p4263, 1952.pdf
    REGEX = r'(^[\d]+_)(.+)(\..*)'
    match = re.search(REGEX, filename.strip())

    if filename[0].isalpha():
        return

    if match and match.groups() >= 3:
        new_filename = ''.join(match.groups()[1:])
        return new_filename

for root, dirs, files in os.walk(HYPERLAB_DIRECTORY):
    for filename in files:
        filename = filename.decode('latin-1')   # This assumes the majority of
                                                # people running this script are Windows users
        original_filepath = os.path.join(root, filename)

        new_filename = rename(filename)
        if new_filename:
            new_filepath = os.path.join(root, new_filename)
            if os.path.exists(new_filepath):
                print('CANNOT RENAME (FILE ALREADY EXISTS): %s' % new_filename)
                continue

            os.rename(
                original_filepath,
                new_filepath
            )
        else:
            print(original_filepath)
Title: Re: Hyperlabs released all their PDFs
Post by: The Lone Stranger on October 01, 2019, 01:26:50 PM
mathiasxx94 and aes256 ....... Thank you very much . I never imagined that i would one day use Python and am quite excited to try it ...... but not at the moment as i have other prioritys . When i do try it i'll let you know what happens .