• Home
  • >>>>>>>>>>>>> 2026 & BEYOND MASTER PLAN >>>>>>>>>>>>>>>>
  • TRAINABLE DATA ASSETS IN CORPORATE LEGACY & ORGANIZATION’s STRATEGIC MEMORY (2012 – 2025)
  • 2.0 LPBI Executive Summary
  • 1.0 LPBI Executive Summary
  • Founder’s Vision
  • Portfolio of IP Assets
  • Knowledge PORTALS System (KPS)
  • 1.0 LPBI – 2012-2020 VISTA
  • LPBI Group’s History
  • 2.0 LPBI – 2021-2025 VISION
  • BioMed e-Series
  • Press Coverage
  • Investor Relations
  • Our TEAM
  • Founder’s Bio
  • Funding, Deals & Partnerships
  • 1.0 LPBI Group News
  • 1.0 LPBI CALENDAR
  • 2.0 LPBI Group News
  • Testimonials about LPBI
  • DrugDiscovery @LPBI Group
  • Medical 3D Printing
  • e-VOICES Podcasting
  • LPBI Newsletters
  • Customer Surveys
  • Health Care INVESTOR’s Corner ($)
  • 2021 Summer Internship Portal
  • 2021-2025 Medical Text Analysis (NLP)
  • Artificial Intelligence: Genomics & Cancer
  • Blockchain Transactions Network Ecosystem
  • 1.0 LPBI Brochure
  • 2.0 LPBI Brochure
  • 2.0 LPBI – Calendar of Zooms
  • Coronavirus, SARS-CoV-2 Portal
  • LPBI India
  • SOP Web STAT
  • Synthetic Biology in Drug Discovery
  • Certificate One Year
  • NFT: Redefined Format of IP Assets
  • Audio English-Spanish: BioMed e-Series
  • Five Bilingual BioMed e-Series
  • Press Releases
  • Intangibles CIM
  • BioMed Audio Podcast Library @LPBI Group
  • Medical School Education: Use of LPBI Group’s Contents
  • ChatGPT + Wolfram PlugIn
  • Medicine with GPT-4 & ChatGPT
  • AGI, generativeAI, Grok, DeepSeek & Expert Models in Healthcare
  • Multimodal Healthcare Foundation Model
  • Cell Therapy in Regenerative Medicine, 2026 and Beyond
  • Contact Us

Leaders in Pharmaceutical Business Intelligence Group, LLC, Doing Business As LPBI Group, Newton, MA

Healthcare analytics, AI solutions for biological big data, providing an AI platform for the biotech, life sciences, medical and pharmaceutical industries, as well as for related technological approaches, i.e., curation and text analysis with machine learning and other activities related to AI applications to these industries.

Feeds:
Posts
Comments

Medical Text Analysis (Deep Learning NLP)

Medical Text Analysis (NLP)

Research Internship

Medical Text Analysis (MTA) with Natural Language Processing (NLP)


Employer Partner: Leaders in Pharmaceutical Business Intelligence (LPBI) Group

Career(s)

Life Science Research, Bioinformatics, AI, Machine Learning, Statistical NLP, semantic Text Analysis, Big Learning

Overview

LPBI Internships offer the following:

  • Affiliation with and mentorship by esteemed scientists and research graduate students.
  • Skills development in NLP, ML, AI applications to Medical Text and Drug Discovery
  • References/letter of recommendation
  • Description of accomplishments and goals achieved during the internship
  • Opportunity to contribute to publications
  • Exploration of opportunities in life sciences in the US
  • Opportunity to collaborate with professionals from various fields such as medicine and Natural Language Processing as well as with other interns.

Internship Description

This research internship introduces students to medical text analysis (NLP).  Students are introduced to an opportunity to learn about curating cutting edge medical articles, learn about methodologies of data curation and data annotation for applications of Natural Language Processing for Text Analysis.

STEP 1:  Domain Knowledge Expert Specifies the selection criteria for a collection of articles:

1.1       Curated & authored articles vs scientific reports

1.2       All articles in a chapter in a book, [N = 1,2,3,  ..,18]

1.3       Selection of articles within a research category [N = 1,2,3,  ..,730]

1.4       Selection of articles within several research categories

STEP 2:  Create .TXT file for each article in the collection

STEP 3:  Create one MERGED .TXT File for all the articles in the collection

STEP 4:  Use WordItOut.com and .TXT file per article to generate one word cloud per article

4.1       Edit Graph – remove connective words

4.2       Upload word clouds to the media gallery and record article title as legend and source for the graph, add your name as image producer and date

4.3       Insert word cloud in the article following the author/curator’s name

4.4       Place word cloud in a one PowerPoint presentation for the entire article collection

STEP 5:  Use .TXT file per article to create a Bar Diagram for the word frequencies in the article

5.1       Edit bar diagram and remove connective words

5.2       Place each bar diagram in the PowerPoint presentation for the article collection

5.3       To generate the bar diagram USE Wolfram CODE and Instructions in DropBox

STEP 6:   Use the one MERGED .TXT file to create ONE Hyper-graph for the entire article collection

6.1       Edit hyper-graph

6.2       Place hyper-graph in the PowerPoint presentation

6.3       To generate the Hypergraph USE Wolfram CODE and Instructions in DropBox

STEP 7:   Use the one MERGED .TXT file to create ONE Tree Diagram for the entire article collection

7.1       Edit tree diagram

7.2       Place tree diagram in the PowerPoint presentation

7.3       To generate the tree diagram USE Wolfram CODE and Instructions in DropBox

STEP 8:   Transfer all visualization in PowerPoint into a Domain Knowledge Expert Interpretation Folder

Types of Students Desired

College students majoring in life sciences or computer science majors

Masters Students will be given additional challenging tasks

Structure/Schedule

  • Cohort/Individual
  • Summer and academic school year internships
  • 16 weeks – (flexible schedule)
  • Zoom 1 time per week with supervisor
  • Zoom 1 time during internship with group
  • Some additional group meetings related to code review, new code instructions, etc.

Internship Project Work Examples

  • Sample project: Curate articles from various medical fields and extract specific information from articles on cancer.*
  • Sample project: Curated Natural Language Processing resources for use in shaping a proof-of-concept pilot project to be used in LPBI Group’s business plan.

Skills Used/Gained

  • Machine Learning (ML)
  • Artificial Intelligence (AI)
  • Applications to medical text analysis
  • Article Classification
  • Creation of Text File Format
  • Merging Text File Formats
  • Creation of word clouds
  • Embedding word clouds in original articles
  • Creation of bar diagrams
  • Generate hyper-graphs from merged files
  • Create tree diagrams from merged files
  • PowerPoint presentation skills
  • File management in Dropbox

* Other articles on genomics, cardiovascular, immunology, infectious diseases, metabolomics, precision medicine and reproductive genomics are available

Verifiable Certifications Offered

  • Medical Text Analysis using statistical NLP, semantic NLP, and Deep Learning

ADDITIONAL INFORMATION

Medical Text Analysis using Wolfram Language for Biological Sciences 

Work on ENTIRE books we published – one book per student [six are assigned already]

18 Books in Medicine

https://lnkd.in/ekWGNqA

Perform Deep Learning NLP on 25 Concepts in each article, as follows

  1. Generate WordClouds for each article in the Book, 
  2. Generate Bar Diagrams for each article in the Book,
  3. Generate Hypergrpahs – ONE for all articles in EVERY chapter in the book – student was assigned
  4. Generate Tree Diagram – ONE for all articles in EVERY chapter in the book – student was assigned
  5. Migrate all artifacts into a Knowledge Graph Flur.ee Blockchain Database – Open source – See #2, below

Trends in Development of Databases and Blockchain | IEEE Conference Publication | IEEE Xplore

https://ieeexplore.ieee.org/abstract/document/9143893
  1. Create a PowerPoint Presentation ONE per Chapter in the Book – SEE attachment [Domain Knowledge Expert Interpretation – Work-in-Progress]
  2. All the PowerPoints will be included in a NEW GENRE of Scientific Book we will publish – Name of student will be on the NLP section 
  3. Structure of the New Book:

Part 1:    eTOCs of the Original Book in English and Spanish: Text & Audio Podcast – Original Authors

Part 2:    Deep Learning Wolfram NLP – SEE attachment – NAME OF STUDENT that produced the NLP results with Wolfram

Part 3:    Editorial [Preface, Introduction, Summary, Epilogue] – English Audio Podcast – Original Authors

Share this:

  • Share on X (Opens in new window) X
  • Share on LinkedIn (Opens in new window) LinkedIn
  • Share on Facebook (Opens in new window) Facebook
  • Print (Opens in new window) Print
  • Email a link to a friend (Opens in new window) Email

Like this:

Like Loading…

  • Follow Blog via Email

    Enter your email address to follow this blog and receive notifications of new posts by email.

    Join 2,056 other subscribers
  • Recent Posts

    • Grok thinks like PhDs – Hybrid Model for Autonomous Update of Doctoral Dissertations with Original PhD Student Author editing and acceptance of Grok’s updated output. 3rd Joint article, a pilot study for Validation of an Autonomous Journal Articles Updating System (AJAUS) tested on Autonomous Update of Aviva Lev-Ari, PhD, RN Doctoral Thesis, UC, Berkeley, 1983. January 31, 2026
    • 2026 World Economic Forum, 1/19 – 1/23/2026, Davos, Switzerland, LPBI Group’s KOLs interpretation of AI in Health Videos January 24, 2026
    • Evolving LPBI Group’s Portfolio of Intellectual Properties (IP): From 2021 Vision to 2026 Reality January 12, 2026
    • Aviva Lev-Ari, PhD, RN – Biography ->> Grokepedia Entry January 11, 2026
    • List of Articles included in the Article SELECTION from Collection of Aviva Lev-Ari, PhD, RN Scientific Articles on PULSE on LinkedIn.com for Training Small Language Models (SLMs) in Domain-aware Content of Medical, Pharmaceutical, Life Sciences and Healthcare by 15 Subjects Matter January 10, 2026
    • Article SELECTION from Collection of Aviva Lev-Ari, PhD, RN Scientific Articles on PULSE on LinkedIn.com for Training Small Language Models (SLMs) in Domain-aware Content of Medical, Pharmaceutical, Life Sciences and Healthcare by 15 Subjects Matter January 10, 2026
    • Collection of Aviva Lev-Ari, PhD, RN Scientific Articles on PULSE on LinkedIn.com January 10, 2026
    • The youngest leader in AI in Health: Tanishq Mathew Abraham, Ph.D. (@iScienceLuvr) January 8, 2026
    • 2026 Grok Multimodal Causal Reasoning on Proprietary Cariovascular Corpus: From 2021 Wolfram NLP Baseline to Thousands of Novel Relationships – A Second Head-to-Head Validation of LPBI’s Domain-Aware Training Advantage January 6, 2026
    • Workflow for Dynamic Linkage and Transition between two Authoring Systems: LPBI Group’s WordPress.com Multi-Authors Authoring System and Microsoft PowerPoint product for Slide show presentation – Part 10.1 in Composition of Methods January 6, 2026
  • Archives

  • Categories

  • Meta

    • Log in
    • Entries feed
    • Comments feed
    • WordPress.org
    • 1 2012pharmaceutical
    • 1 sjwilliamspa

Powered by WordPress.com.

WPThemes.


%d