WORKFLOW for a Ten-Steps Medical Text Analysis Operation using NLP on LPBI Medical and Life Sciences Content
Author: Aviva Lev-Ari, PhD, RN with inputs related to Wolfram NLP Code by Madison Davis and Yash Choudhary
- All INTERNS will work 50% on NLP and 50% on Synthetic Biology
- Training will be offered
- Protocol will be developed
- Software applications will be selected.
- Mid September we will have an Internal Meeting on Mission #2, LPBI India before the Meeting with Dr. Nir
- All Interns need to complete at least ½ an e-Book Ten-Step Workflow Protocol for NLP before starting the Synthetic Biology SW Training
- This is the Ten-Step WORKFLOW Protocol for Medical Text Analysis using NLP at LPBI:
- https://pharmaceuticalintelligence.com/2021/07/15/workflow-for-a-ten-steps-medical-text-analysis-operation-using-nlp-on-lpbi-medical-and-life-sciences-content/
- Each INTERN Completing the 50% assignment on NLP will need to submit this table for his/hers NLP Book Assignment with a Check off mark for each article in each Chapter in the Book the intern was assigned for
- This Table filled in serves as INPUT for QA of the work of the INTERN. Verification is needed for Internship completion for Certification purposes
NLP | Step 1 | Step 2 | Step 3 | Step 4 | Step 5 | Step 6 | Step 7 | Step 8 | Step 9 | Step 10 |
Chapter 1, Article 1 | ||||||||||
Chapter 1, Article n | ||||||||||
Chapter 2 Article 1 | ||||||||||
Chapter 2, Article n | ||||||||||
Chapter 3, Article 1 | ||||||||||
Chapter 3, Article n | ||||||||||
Chapter 4 Article 1 | ||||||||||
Chapter 4, Article n | ||||||||||
Chapter 5, Article 1 | ||||||||||
Chapter 5, Article n | ||||||||||
Chapter n Article 1 | ||||||||||
Chapter n Article n |
Table Source :
Author: Aviva Lev-Ari, PhD, RN, 7/23/2021
This Table is the supporting evidence for:
- (a) obtaining the Completion Certification of the NLP Internship at LPBI, start date, completion date, two signatures
- (b) Handing off of the completed assignment for additional QA Verification
- (c) Submission of PERSONAL PAGE nested under Medical Text Analysis using NLP https://pharmaceuticalintelligence.com/2021-medical-text-analysis-nlp/
- (c) Getting New Assignment in Synthetic Biology in Drug Discovery
- (d) Each Intern assigned to (c) will have a PERSONAL PAGE for recording all the milestones by date, recent at the top. This personal page will be nested under https://pharmaceuticalintelligence.com/synthetic-biology-in-drug-discovery/
LPBI’s WORKFLOW for Medical Text Analysis Operation using NLP on
LPBI Medical and Life Sciences Content
THESE ARE THE INSTRUCTIONS FOR PERFORMANCE OF ALGORITHMS FOR NLP – INSTRUCTIONS ARE AUTHORED BY MADISON DAVIS. CODE FOR BAR DIAGRAM PLOTS, HYPERGRAPH VERSION 1 AND TREE DIAGRAM PLOTS REPRESENT CODE WRITTEN BY YASH CHOUDHARY.
STEP 1: Create Merged .TXT File of Data for Semantic Analysis
- Go to Wolfram Cloud and login (use old or make free account)
- Make a new folder: click “New” then “Folder” in the corner of your dashboard
- Name your folder. This is where you’ll put your text files
- Make a new text file: click “New” then “Text” in the corner of your dashboard
- Name your text file.
- Put this text file in the folder you made.
- Copy and paste the text you want to analyze in the text file. The text can be from one article or multiple articles.
- Verify that the text has been saved by reopening the file. Sometimes you may need to repeat the process of copying and pasting text so it works.
STEP 2: Create Presentation Slides to Copy and Paste Data, Image Graphs Into
STEP 3: View this Document for a Guide on How to Make Semantics Graphs:
- Link
- Add graphs to presentation slides
- The link covers the following algorithms:
- 25 KeyWord Extraction
- Hypergraphs (make one hypergraph per article collection)
- Tree Diagrams (make one tree diagram per article collection)
- Bar Diagrams
STEP 4: Use WordItOut.com and .TXT file per article to generated One WordCloud per article
- Go to WorditOut.com, as seen here: https://worditout.com/
- Click “Create Your Own”
- Copy and Paste the text of the article you want into the text box
- Click “Generate”
- Change the font to “serif”, disable “random on regenerate”
- Change the colors based on the following guidelines, although this is just a suggestion.
- Change the number of colors to 2, color blending method “Direct”
- Save the image
- Upload WordClouds to the Media Gallery and record Article title as Legend and Source for the graph, add your name as image producer and date
- Insert World Cloud in the Article following the Author/Curator’s name
- Place WordCloud in a one PowerPoint Presentation for the entire Article Collection
STEP 5: Transfer Powerpoint to DropBox
STEP 6: Create a .DOCX Interpretation File of Results
- Domain Knowledge Expert generates a .DOCX file with his expert interpretation of all the Insights drawn from the visualization artifacts generated by NLP, ML, AI when all the insights are put together for analysis and synthesis. Store the Expert interpretation into the Interpretation Folder.
- What are the clinical implications for patient treatment
- What are the clinical insights for drug discovery for Big Pharma?
- Are there clues for risk adjustment and policy writing tips for health care insurers?
STEP 7: Transfer copy of Interpretations files for Translation into Foreign Languages: Spanish, Japanese, Russian into Folders with Language Name
STEP 8: Under Construction: Enrichment of the original content with External Repositories
This is very insightful. There is no doubt that there is the bias you refer to. 42 years ago, when I was postdocing in biochemistry/enzymology before completing my residency in pathology, I knew that there were very influential mambers of the faculty, who also had large programs, and attracted exceptional students. My mentor, it was said (although he was a great writer), could draft a project on toilet paper and call the NIH. It can’t be true, but it was a time in our history preceding a great explosion. It is bizarre for me to read now about eNOS and iNOS, and about CaMKII-á, â, ã, ä – isoenzymes. They were overlooked during the search for the genome, so intermediary metabolism took a back seat. But the work on protein conformation, and on the mechanism of action of enzymes and ligand and coenzyme was just out there, and became more important with the research on signaling pathways. The work on the mechanism of pyridine nucleotide isoenzymes preceded the work by Burton Sobel on the MB isoenzyme in heart. The Vietnam War cut into the funding, and it has actually declined linearly since.
A few years later, I was an Associate Professor at a new Medical School and I submitted a proposal that was reviewed by the Chairman of Pharmacology, who was a former Director of NSF. He thought it was good enough. I was a pathologist and it went to a Biochemistry Review Committee. It was approved, but not funded. The verdict was that I would not be able to carry out the studies needed, and they would have approached it differently. A thousand young investigators are out there now with similar letters. I was told that the Department Chairmen have to build up their faculty. It’s harder now than then. So I filed for and received 3 patents based on my work at the suggestion of my brother-in-law. When I took it to Boehringer-Mannheim, they were actually clueless.
This is very insightful. There is no doubt that there is the bias you refer to. 42 years ago, when I was postdocing in biochemistry/enzymology before completing my residency in pathology, I knew that there were very influential mambers of the faculty, who also had large programs, and attracted exceptional students. My mentor, it was said (although he was a great writer), could draft a project on toilet paper and call the NIH. It can’t be true, but it was a time in our history preceding a great explosion. It is bizarre for me to read now about eNOS and iNOS, and about CaMKII-á, â, ã, ä – isoenzymes. They were overlooked during the search for the genome, so intermediary metabolism took a back seat. But the work on protein conformation, and on the mechanism of action of enzymes and ligand and coenzyme was just out there, and became more important with the research on signaling pathways. The work on the mechanism of pyridine nucleotide isoenzymes preceded the work by Burton Sobel on the MB isoenzyme in heart. The Vietnam War cut into the funding, and it has actually declined linearly since.
A few years later, I was an Associate Professor at a new Medical School and I submitted a proposal that was reviewed by the Chairman of Pharmacology, who was a former Director of NSF. He thought it was good enough. I was a pathologist and it went to a Biochemistry Review Committee. It was approved, but not funded. The verdict was that I would not be able to carry out the studies needed, and they would have approached it differently. A thousand young investigators are out there now with similar letters. I was told that the Department Chairmen have to build up their faculty. It’s harder now than then. So I filed for and received 3 patents based on my work at the suggestion of my brother-in-law. When I took it to Boehringer-Mannheim, they were actually clueless.