obtenir tout entre <tag> et </tag> avec php

Question

J'essaye de saisir une chaîne dans une chaîne en utilisant l'expression régulière.

J'ai regardé et regardé mais je n'arrive pas à obtenir les exemples sur lesquels je dois travailler.

J'ai besoin de saisir les balises html <code> et </code> et tout ce qui se trouve entre les deux.

Ensuite, je dois extraire la chaîne correspondante de la chaîne parente, effectuer des opérations sur les deux,

puis remettez la chaîne correspondante dans la chaîne parente.

Voici mon code:

$content = "Lorem ipsum dolor sit amet, consectetur adipiscing elit. &lt;code>Donec sed erat vel diam ultricies commodo. Nunc venenatis tellus eu quam suscipit quis fermentum dolor vehicula.&lt;/code>" $regex=''; $code = preg_match($regex, $text, $matches);

J'ai déjà essayé ces derniers sans succès:

$regex = "/<code\s*(.*)\>(.*)<\/code>/"; $regex = "/<code>(.*)<\/code>/";

piouPiouM · Answer

Vous pouvez utiliser les éléments suivants:

$regex = '#<\s*?code\b[^>]*>(.*?)</code\b[^>]*>#s';

\b garantit qu'une faute de frappe (comme <codeS>) n'est pas capturé.
Le premier motif [^>]* capture le contenu d'une balise avec des attributs (par exemple une classe).
Enfin, le drapeau s capture le contenu avec des sauts de ligne.

Voir le résultat ici: http://lumadis.be/regex/test_regex.php?id=1081

Joe · Answer

$regex = '#<code>(.*?)</code>#';

En utilisant # comme délimiteur au lieu de / parce qu'alors nous n'avons pas besoin d'échapper au / dans </code>

Comme Phoenix l'a indiqué ci-dessous, .*? est utilisé pour rendre le .* ("n'importe quoi") correspond au moins de caractères possible avant de rencontrer un </code> (appelé "quantificateur non gourmand"). De cette façon, si votre chaîne est

<code>hello</code> something <code>again</code>

vous allez faire correspondre hello et again au lieu de simplement faire correspondre hello</code> something <code>again.

moni as · Answer

cette fonction a fonctionné pour moi

<?php function everything_in_tags($string, $tagname) { $pattern = "#<\s*?$tagname\b[^>]*>(.*?)</$tagname\b[^>]*>#s"; preg_match($pattern, $string, $matches); return $matches[1]; } ?>

AlbertoMinetti · Answer

vous pouvez aussi utiliser /<code>([\s\S]*)<\/code>/msU ce catch NEWLINES!

Nate · Answer

function contentDisplay($text) { //replace UTF-8 $convertUT8 = array("\xe2\x80\x98", "\xe2\x80\x99", "\xe2\x80\x9c", "\xe2\x80\x9d", "\xe2\x80\x93", "\xe2\x80\x94", "\xe2\x80\xa6"); $to = array("'", "'", '"', '"', '-', '--', '...'); $text = str_replace($convertUT8,$to,$text); //replace Windows-1252 $convertWin1252 = array(chr(145), chr(146), chr(147), chr(148), chr(150), chr(151), chr(133)); $to = array("'", "'", '"', '"', '-', '--', '...'); $text = str_replace($convertWin1252,$to,$text); //replace accents $convertAccents = array('À', 'Á', 'Â', 'Ã', 'Ä', 'Å', 'Æ', 'Ç', 'È', 'É', 'Ê', 'Ë', 'Ì', 'Í', 'Î', 'Ï', 'Ð', 'Ñ', 'Ò', 'Ó', 'Ô', 'Õ', 'Ö', 'Ø', 'Ù', 'Ú', 'Û', 'Ü', 'Ý', 'ß', 'à', 'á', 'â', 'ã', 'ä', 'å', 'æ', 'ç', 'è', 'é', 'ê', 'ë', 'ì', 'í', 'î', 'ï', 'ñ', 'ò', 'ó', 'ô', 'õ', 'ö', 'ø', 'ù', 'ú', 'û', 'ü', 'ý', 'ÿ', 'A', 'a', 'A', 'a', 'A', 'a', 'C', 'c', 'C', 'c', 'C', 'c', 'C', 'c', 'D', 'd', 'Ð', 'd', 'E', 'e', 'E', 'e', 'E', 'e', 'E', 'e', 'E', 'e', 'G', 'g', 'G', 'g', 'G', 'g', 'G', 'g', 'H', 'h', 'H', 'h', 'I', 'i', 'I', 'i', 'I', 'i', 'I', 'i', 'I', 'i', '?', '?', 'J', 'j', 'K', 'k', 'L', 'l', 'L', 'l', 'L', 'l', '?', '?', 'L', 'l', 'N', 'n', 'N', 'n', 'N', 'n', '?', 'O', 'o', 'O', 'o', 'O', 'o', 'Œ', 'œ', 'R', 'r', 'R', 'r', 'R', 'r', 'S', 's', 'S', 's', 'S', 's', 'Š', 'š', 'T', 't', 'T', 't', 'T', 't', 'U', 'u', 'U', 'u', 'U', 'u', 'U', 'u', 'U', 'u', 'U', 'u', 'W', 'w', 'Y', 'y', 'Ÿ', 'Z', 'z', 'Z', 'z', 'Ž', 'ž', '?', 'ƒ', 'O', 'o', 'U', 'u', 'A', 'a', 'I', 'i', 'O', 'o', 'U', 'u', 'U', 'u', 'U', 'u', 'U', 'u', 'U', 'u', '?', '?', '?', '?', '?', '?'); $to = array('A', 'A', 'A', 'A', 'A', 'A', 'AE', 'C', 'E', 'E', 'E', 'E', 'I', 'I', 'I', 'I', 'D', 'N', 'O', 'O', 'O', 'O', 'O', 'O', 'U', 'U', 'U', 'U', 'Y', 's', 'a', 'a', 'a', 'a', 'a', 'a', 'ae', 'c', 'e', 'e', 'e', 'e', 'i', 'i', 'i', 'i', 'n', 'o', 'o', 'o', 'o', 'o', 'o', 'u', 'u', 'u', 'u', 'y', 'y', 'A', 'a', 'A', 'a', 'A', 'a', 'C', 'c', 'C', 'c', 'C', 'c', 'C', 'c', 'D', 'd', 'D', 'd', 'E', 'e', 'E', 'e', 'E', 'e', 'E', 'e', 'E', 'e', 'G', 'g', 'G', 'g', 'G', 'g', 'G', 'g', 'H', 'h', 'H', 'h', 'I', 'i', 'I', 'i', 'I', 'i', 'I', 'i', 'I', 'i', 'IJ', 'ij', 'J', 'j', 'K', 'k', 'L', 'l', 'L', 'l', 'L', 'l', 'L', 'l', 'l', 'l', 'N', 'n', 'N', 'n', 'N', 'n', 'n', 'O', 'o', 'O', 'o', 'O', 'o', 'OE', 'oe', 'R', 'r', 'R', 'r', 'R', 'r', 'S', 's', 'S', 's', 'S', 's', 'S', 's', 'T', 't', 'T', 't', 'T', 't', 'U', 'u', 'U', 'u', 'U', 'u', 'U', 'u', 'U', 'u', 'U', 'u', 'W', 'w', 'Y', 'y', 'Y', 'Z', 'z', 'Z', 'z', 'Z', 'z', 's', 'f', 'O', 'o', 'U', 'u', 'A', 'a', 'I', 'i', 'O', 'o', 'U', 'u', 'U', 'u', 'U', 'u', 'U', 'u', 'U', 'u', 'A', 'a', 'AE', 'ae', 'O', 'o'); $text = str_replace($convertAccents,$to,$text); //Encode the characters $text = htmlentities($text); //normalize the line breaks (here because it applies to all text) $text = str_replace("
", "
", $text); $text = str_replace("
", "
", $text); //decode the <code> tags $codeOpen = htmlentities('<').'code'.htmlentities('>'); if (strpos($text, $codeOpen)) { $text = str_replace($codeOpen, html_entity_decode(htmlentities('<')) . "code" . html_entity_decode(htmlentities('>')), $text); } $codeOpen = htmlentities('<').'/code'.htmlentities('>'); if (strpos($text, $codeOpen)) { $text = str_replace($codeOpen, html_entity_decode(htmlentities('<')) . "/code" . html_entity_decode(htmlentities('>')), $text); } //match everything between <code> and </code>, the msU is what makes this work here, ADD this to REGEX archive $regex = '/<code>(.*)</code>/msU'; $code = preg_match($regex, $text, $matches); if ($code == 1) { if (is_array($matches) && count($matches) >= 2) { $newcode = $matches[1]; $newcode = nl2br($newcode); } //remove <code>and this</code> from $text; $text = str_replace('<code>' . $matches[1] . '</code>', 'PLACEHOLDERCODE1', $text); //convert the line breaks to paragraphs $text = '<p>' . str_replace("

", '</p><p>', $text) . '</p>'; $text = str_replace("
" , '<br />', $text); $text = str_replace('</p><p>', '</p>' . "

" . '<p>', $text); $text = str_replace('PLACEHOLDERCODE1', '<code>'.$newcode.'</code>', $text); } else { $code = false; } if ($code == false) { //convert the line breaks to paragraphs $text = '<p>' . str_replace("

", '</p><p>', $text) . '</p>'; $text = str_replace("
" , '<br />', $text); $text = str_replace('</p><p>', '</p>' . "

" . '<p>', $text); } return $text; }

Milind Singh · Answer

Vous pouvez également essayer:

function getTagValue($string, $tag) { $pattern = "/<{$tag}>(.*?)<\/{$tag}>/s"; preg_match($pattern, $string, $matches); return isset($matches[1]) ? $matches[1] : ''; }

Il retourne une chaîne vide en cas de non correspondance.