LIS Links

First and Largest Academic Social Network of LIS Professionals in India

Welcome to
LIS Links

Or sign in with:

LIS Links Becoming More Social

LIS Links Mailing List (Broadcast Message)

LIS Links WhatsApp
LIS Links Telegram Channel
LIS Links Telegram Group
LIS Links Facebook Page
LIS Links Facebook Group
LIS Links Twitter Profile
LIS Links YouTube Channel

Birthdays

Birthdays Today

Latest Activity

Dr. Mohit Banerjee updated their profile

9 hours ago

M S M Shiham and Frank Morrison are now friends

16 hours ago

M S M Shiham updated their profile

16 hours ago

lavkush agrawal is attending Dr. U. Pramanathan's event

Two day Workshop on Koha on 14-15 December 2024. at KLA Head Quarters, Room No. 36, II Floor, Kairalie Plaza Annexe, Karamana–Killippalam NH, Karamana PO, Thiruvananthapuram–695002. Kerala

December 14, 2024 to December 15, 2024

Dec 10

0 Comments

KHRITISH SWARGIARY updated their profile

Dec 7

Hemanta Gogoi and CAROLINE SARAH are now friends

Dec 5

KHRITISH SWARGIARY posted a status

"What are you up to?"

Dec 5

0 Comments

KHRITISH SWARGIARY left a comment for KHRITISH SWARGIARY

Dec 3

Dr. U. Pramanathan joined Dr. Badan Barman's group

Sample in Library and Information Science

Dec 3

1 Comment

Raghavendra P Inganal and Dr. U. Pramanathan joined Dr. Badan Barman's group

Circular in Library and Information Science

Dec 2

6 Comments

Saanvi Singh posted a status

"IIT Delhi Assistant Librarian Vacancy: https://shorturl.at/TqkTB"

Dec 2

0 Comments

Urvashi kaushik and Dr. U. Pramanathan are now friends

Dec 1

SIVA PAUL updated their profile

Nov 30

lavkush agrawal joined Dr. Badan Barman's group

Circular in Library and Information Science

Nov 29

6 Comments

lavkush agrawal is now friends with Karthikeyan C, mohammed muneer, rajesh shukla and sudeep gupta

Nov 29

lavkush agrawal updated their profile

Nov 29

Chandrashekhara N updated their profile

Nov 28

Dr. U. Pramanathan posted a discussion

Membership Options of Society for the Advancement of Library and Information Science (SALIS) through SALIS App:

Nov 28

0 Comments

Dr. U. Pramanathan posted events

National Conference on "Reimagining of Libraries: The Future of Traditional and Modern Synergy".
January 4, 2025 from 10am to 5pm
RULIC–2025: International Conference on 'Viksit Bharat 2047: The Role of Public Libraries in Preserving Indian Knowledge Systems and Integrating NEP 2020
January 3, 2025 at 6pm to January 5, 2025 at 7pm
Two day Workshop on Koha on 14-15 December 2024.
December 14, 2024 to December 15, 2024

1 more…

Nov 28

Dr. U. Pramanathan posted blog posts

10 more…

Nov 28

PDF Metadata Extractor Information Needed

Dear Friends

We are in need of a PDF Metadata Extractor Information, preferably free and not online. Please share the information if anybody using it. Actually it is for using in combination with DSpace software, but we can not go online with our collection.

Any help will be highly appreciated.

Thank you

Subeesh A C

▶ Reply to This

Replies to This Forum

Permalink Reply by Baskar Selvaraj on February 16, 2016 at 15:09

Try ExitTool

http://www.sno.phy.queensu.ca/~phil/exiftool/

I have been using it for extracting metadata from PDFs for using in DSpace. It is possible to extract metadata from all PDFs at one go, if you are familiar with command line options.

S. Baskar

▶ Reply

Permalink Reply by Subeesh A C on February 17, 2016 at 0:25

Thank you very much sir

But I think the tool is extracting data from document properties in my try. Are you getting the appropriate data with exiftool?

Subeesh A C

▶ Reply

Permalink Reply by Baskar Selvaraj on February 17, 2016 at 4:45

Hi,

Using the below command, you can extract all metadata (i.e. all metadata tags associated with the PDF document) from hundreds of PDF documents and save it as CSV file which could be used for doing batch import within DSpace.

In case, if you require only specific tags, then you have to mention the required metadata tags for extracting. I have given an example below for your understanding.

To extract all available metadata tags from the PDF documents and save it as a CSV file

---------------------------------------------------------------------------------------------------------------------

exiftool -csv *.pdf > output.csv

To extract specific metadata tags from the PDF documents and save it as a CSV file

-----------------------------------------------------------------------------------------------------------------------------

exiftool -TAG -Title -TAG -Author -TAG -Producer -TAG -Subject -TAG -Description -TAG -Type -TAG -Keywords -TAG -ISBN -TAG -Isbn -TAG -Createdate -TAG -CourseID -TAG -FileSize -TAG -PageCount -TAG -PDFVersion -d %Y-%m-%d *.pdf -csv > output.csv

Hope this helps.

S. Baskar

LinuXpert Systems

▶ Reply

Permalink Reply by Baskar Selvaraj on February 17, 2016 at 4:54

ExifTool Tag Names

The tables listed below give the names of all tags recognized by ExifTool.

http://www.sno.phy.queensu.ca/~phil/exiftool/TagNames/index.html

▶ Reply

Permalink Reply by Subeesh A C on February 20, 2016 at 19:27

Thank you very much sir

▶ Reply

Permalink Reply by Mujib Rahiman K U on February 17, 2016 at 18:34

I have created a small uitlity for extracting information from pdf files few years ago . it will extract data from all files in a folder and save in tab delimited text file.

you can try it. hope it helps. pls let me know.

i have uploaded the program to google drive. Click here to download

with regards

Mujib Rahiman

KV Kanjikode

▶ Reply

Permalink Reply by Subeesh A C on February 20, 2016 at 19:28

Thanks sir, I will surely let you know.

Regards

Subeesh A C

▶ Reply

Permalink Reply by Subeesh A C on February 22, 2016 at 23:38

Sir

I have checked your software, its a great effort if you have coded it yourself. As I see most of the software(s) are not able to identify the pdf files metadata as we require. I think the problem is mostly revolve around the structure of pdf files itself. In my case the pdf files are not having any standard structure (+ OCR ) in it for the algorithm to extract as it did for any appropriate one. Since we are in hurry and we require more metadata for the current work, we are thinking of indexing it and filtering it later through various categories. Anyway thanks for your reply.

LIS Links

LIS Links Becoming More Social

Birthdays

Birthdays Today

Latest Activity

Two day Workshop on Koha on 14-15 December 2024. at KLA Head Quarters, Room No. 36, II Floor, Kairalie Plaza Annexe, Karamana–Killippalam NH, Karamana PO, Thiruvananthapuram–695002. Kerala

Sample in Library and Information Science

Circular in Library and Information Science

Circular in Library and Information Science

Membership Options of Society for the Advancement of Library and Information Science (SALIS) through SALIS App:

National Conference on "Reimagining of Libraries: The Future of Traditional and Modern Synergy".

RULIC–2025: International Conference on 'Viksit Bharat 2047: The Role of Public Libraries in Preserving Indian Knowledge Systems and Integrating NEP 2020

Two day Workshop on Koha on 14-15 December 2024.

Walk-in on 02/12/2024 (Monday) for engagement of B.L.I.Sc Graduate Apprentice @ National Institute of Ocean Technology (NIOT), Chennai-600 100. Tamilnadu.

IMPORTANT NOTICE: Recruitment Advertisement for Librarian Examination-2022 Dated: 30/12/2022-Madhya Pradesh Public Service Commission (MPPSC), Indore, M.P.

Applications are invited from eligible candidates for the position of Library and Information Assistant (LIA) @ ICFRE-Institute of Wood Science and Technology (IWST), Bengaluru, Karnataka.

PDF Metadata Extractor Information Needed

Replies to This Forum

ExifTool Tag Names